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Technical field 



5 The present invention relates to a novel polynucleotide sequence which encodes a novel polypeptide belonging to the 

class of proteins known as G-protein coupled receptors (GPCRs). The present invention also relates, inter alia, to 
processes for producing the polypeptide and its uses. 

Background of the invention 

10 

Cells and tissues respond to a wide variety of extracellular signalling molecules through the interaction of these 
molecules with specific cell-surface receptors. One such class of receptors are known as G-protein coupled receptors 
(GPCRs) and these are characterised by containing a series of 7 hydrophobic transmembrane segments. Upon binding an 
extracellular ligand to its receptor, intracellular signals are initiated via interactions with heterotrimeric G proteins 
15 which, in turn, can lead to a number of different intracellular events depending upon which receptor has been activated. 
For example some GPCRs influence adenyl cyclase activity whereas others act via phospholipase C. 

Members of the GPCR superfamity respond to a wide variety of ligands including small molecule amines (such as 
serotonin, dopamine, acetylcholine), lipid-derived mediators (such as LpA), amino acid derivatives (such as glutamate) and 
neurotransmitter peptides and hormones (such as neurokinin, galanin, glucagon, gastrin). Although GPCRs are activated 
20 by a broad range of ligands, it should be noted that individual GPCRs have a small and very specific repertoire of ligands. 
Based upon an analysis of the primary structure of a novel GPCR, it is now possible to classify them into specific sub- 
families, thereby narrowing the range of potential ligands. 

In many cases, the endogenous ligands of GPCRs are relatively small, enabling them to be mimicked or blocked by 
synthetic analogues. For example drugs such as prazosin, doxazosin, cimetidine, ranitidine are all effective antagonists of 
25 their respective target GPCRs. 

Thus, as the modulation of GPCRs can have therapeutic consequences, there is a continued need to provide new 
GPCRs and their associated agonists and antagonists. 



Summary aspects of the invention 

30 

In a broad aspect, the present invention relates to novel amino acid sequences. In this regard, a specific novel amino 
acid sequence has been isolated and it is to be understood that the invention covers that sequence as well as novel 
variants, fragments, derivatives and homologues thereof. 

In another broad aspect, the present invention relates to novel nucleic acid sequences. In this regard, a specific novel 
nucleic acid sequence has been isolated and it is to be understood that the invention covers that sequence as well as novel 
35 variants, fragments, derivatives arid homologues thereof. 

Thus, in brief, some aspects of the present invention relate to: 

1 . Novel amino acids. 
40 2. Novel nucleotide sequences. 

3. Assays using said novel sequences. 

4. Compounds/compositions identified by use of said assays. 

45 

5. Expression systems comprising or expressing said novel sequences. 

6. Methods of treatment based on said novel sequences. 

50 7. Pharmaceutical compositions based on said novel sequences. 



Other aspects concerning the amino acid sequence of the present invention and/or the nucleotide sequence of the 
present invention include: a construct comprising or capable of expressing the sequences of the present invention; a vector 
comprising or capable of expressing the sequences of the present invention; a plasmid comprising or capable of expressing 
the sequences of the present invention; a cell transfected or viratly-transduced with a construct/vector/plasmid comprising 
or capable of expressing the sequences of the present invention; a tissue comprising or capable of expressing the 
sequences of the present invention; an organ comprising or capable of expressing the sequences of the present invention; 
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a transformed host comprising or capable of expressing the sequences of the present invention; and a transformed 
organism comprising or capable of expressing the sequences of the present invention. The present invention also 
encompasses methods of expressing the same, such as expression in a micro-organism; including methods for transferring 
the same. 

5 For ease of reference, aspects of the present invention are now discussed under appropriate section headings. However, 

the teachings under each section are not necessarily limited to each particular section. 

In the following commentary references to "nucleotide sequence of the present invention" and "amino acid sequence of 
the present invention" refer respectively to any one or more of the nucleotide sequences presented or discussed herein and 
to any one or more of the amino acid sequences presented or discussed herein. Also, and as used herein, "amino acid 

10 sequence" refers to peptide or protein sequences and may refer to portions thereof. In addition, the term "amino acid 
sequence of the present invention" is synonymous with the phrase "polypeptide sequence of the present invention". Also, 
the term "nucleotide sequence of the present invention" is synonymous with the phrase "polynucleotide sequence of the 
present invention". 



Detailed aspects of the invention 

According to one aspect of the present invention, there is provided an isolated and/or purified polynucleotide comprising 
one or more of: 

(a) a polynucleotide encoding the polypeptide as set forth in SEQ ID NO: 2; 

(b) a polynucleotide comprising a nucleotide sequence of SEQ ID NO: 1; 

(c) a polynucleotide encoding the polypeptide expressed by the DNA contained in NCIMB 41068; 



25 (d) a polynucleotide comprising a nucleotide sequence that has at least 70% identity to the polynucleotide of any one 

of (a) to (c); 

(e) a polynucleotide comprising a nucleotide sequence which is capable of hybridising to the polynucleotide of anyone 
of (a) to (d); 

30 

(f) a complement to the polynucleotide of any one of (a) to (e); or 

(g) a polynucleotide fragment of the polynucleotide of any one of (a) to (f). 



Preferably, the poiyniicleotide 'comprises a nucleotide sequence that has at least 75% identity to the polynucleotide of 
any one of (a) to (cj. More preferably, the polynucleotide comprises a nucleotide sequence that has at least 80% identity to 
the polynucleotide of any one of (a) to (c). Even more preferably, the polynucleotide comprises a nucleotide sequence that 
has at least 85% identity to the polynucleotide of any one of (a) to (c). Yet more preferably, the polynucleotide comprises a 
nucleotide sequence that has at least 90% identity to the polynucleotide of any one of (a) to (c). More preferably, the 
polynucleotide comprises a nucleotide sequence that has at least 95% identity to the polynucleotide any one of (a) to (c). 
Most preferably, the polynucleotide comprises a nucleotide sequence that has at least 98% identity to the polynucleotide 
any one of (a) to (c). 

The polynucleotide described above preferably encodes a G-protein coupled receptor (GPCR). 

The present invention also provides a polynucleotide probe or primer comprising at least 15 contiguous nucleotides of 
the polynucleotide described above. 

The present invention yet further provides a vector comprising the polynucleotide described above. 

According to a further aspect of the present invention, there is provided a host cell transformed or transfected with the 
vector described above. Preferably, the host cell is a mammalian, insect, fungal, bacterial or yeast cell. 

According to a further aspect of the present invention, there is provided the transcribed RNA product of the 
polynucleotide described above. There is also provided an RNA molecule or a fragment thereof which is antisense in 
relation to the RNA product and is capable of hybridising thereto. 

There is yet further provided a ribozyme or zinc finger protein capable of binding to the polynucleotide described above. 

According to yet a further aspect of the present invention, there is provided a process for producing a polypeptide or 
fragment thereof comprising culturing said host cell under conditions sufficient for the expression of said polypeptide or 
fragment. Preferably, said polypeptide or fragment is expressed at the surface of said cell. The process preferably further 
includes recovering the polypeptide or fragment from the culture. 

There is also provided by the present invention a process for producing cells capable of expressing a polypeptide or 
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fragment thereof comprising transforming or transfecting cells with the vector described above. 

According to a further embodiment of the present invention, there are provided cells produced by the process described 
above. There is also provided a membrane preparation of said cells. 

According to another aspect of the present invention, there is provided a polypeptide comprising: 

(a) a polypeptide having the deduced amino acid sequence translated from the polynucleotide sequence in SEQ ID 
NO: 1 and variants, fragments, homologues, analogues and derivatives thereof; 

(b) a polypeptide of SEQ ID NO: 2 and variants, fragments, homologues, analogues and derivatives thereof; or 

(c) a polypeptide encoded by the cDNA of NCIMB 41068 and variants, fragments, homologues, analogues and 
derivatives thereof. 



j 5 There is also provided by the present invention an antibody against the polypeptide described above. 

The present invention yet further provides a compound, which modulates the polypeptide described above. Preferably, 
the compound antagonises or selectively antagonises the polypeptide. Alternatively, the compound agonises the 
polypeptide. 

Also provided by the present invention is a pharmaceutical composition comprising the antibody or compound described 
20 above and one or more pharmaceutical^ acceptable carriers, diluents, adjuvants or excipients. 

According to another aspect of the present invention, there is provided a method for identifying a compound, which 
binds to and modulates the polypeptide described above comprising contacting said polypeptide with a candidate 
compound and determining whether modulation occurs. 

Preferably, said method comprises: 



(a) contacting a compound with cells expressing the potypeptide described above on their cell surface, said 
polypeptide being associated with a second component capable of providing a detectable signal in response to the 
binding of a compound to said polypeptide; said contacting being under conditions sufficient to permit binding of 
compounds to the polypeptide; and 

(b) identifying a compound capable of polypeptide binding by detecting the signal produced by said second 
component. 

Alternatively, said method comprises: 

(a) contacting fi) a detectable first component known to bind to the polypeptide described above and (ii) a compound, 
with cells expressing the above polypeptide on their cell surface, said polypeptide being associated with a 
second component capable of providing a detectable signal in response to the binding of a compound to said 
polypeptide; said contacting being under conditions sufficient to permit binding of compounds to the polypeptide; and 

(b) determining whether the first component binds to the polypeptide by detecting the absence or otherwise of a signal 
generated from the interaction of the first component with the polypeptide. 



45 The compound identified by any of the above methods preferably binds to and (i) antagonises or selectively antagonises 
the polypeptide described above, or (ii) agonises the polypeptide described above. 

As GPCRs are involved in signal transduction, modulators (e.g. agonists or antagonists) of the polypeptide of the 
present invention can find use in interfering in the signal transduction process. 

Therefore, according to yet another embodiment of the present invention, there is provided the antibody, compound or 
50 composition described above for use as a pharmaceutical. 

Such antibodies, compounds and compositions, which can modulate the polypeptide of the present invention, can 
therefore find use in the therapeutic areas which concern aspects of signal transduction. Therapeutically usefully areas 
include, but are not limited to, diabetes and metabolic disease, neurological disease, psychotherapeutics, urogenital 
disease, reproduction and sexual medicine, inflammation, cancer, tissue repair, dermatology, skin pigmentation, 
55 photoageing, frailty, osteoporosis, cardiovascular disease, gastrointestinal disease, antiinfection, allergy and respiratory 
disease, sensory organ disorders, steep disorders and hairioss. 

Accordingly, there is also provided the use of the compound described above in the manufacture of a medicament for 
the treatment of a patient having need to modulate the polypeptide described above. Preferably, the treatment is for a 
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patient having need to antagonise or selectively antagonise the polypeptide. Alternatively, the treatment is for a patient 
having need to agonise the polypeptide. 

According to yet a further aspect of the invention, there is provided a method for the treatment of a patient having need 
to modulate the polypeptide described above comprising administering to the patient a therapeutically effective amount of 
5 the above-described compound. Preferably, said method is for the treatment of a patient having need to antagonise or 
selectively antagonise the polypeptide. Alternatively, said method is for the treatment of a patient having need to agonise 
the polypeptide. 

Preferably, said compound is a polypeptide and a therapeutically effective amount of the compound is administered by 
providing to the patient DNA encoding said compound and expressing said compound in vivo. 

10 There is also provided by the present invention use of the antibody described above in the manufacture of a medicament 
for the treatment of a patient having need to modulate the polypeptide described above. Preferably, said method is for the 
treatment of a patient having need to antagonise or selectively antagonise the polypeptide. Alternatively, said method is for 
the treatment of a patient having need to agonise the polypeptide. 

Yet further provided by the present invention is a method for the treatment of a patient having need to modulate the 

15 polypeptide described above, comprising administering to the patient a therapeutically effective amount of the antibody 
described above. Preferably, said method is for the treatment of a patient having need to antagonise or selectively 
antagonise the polypeptide. Alternatively, said method is for the treatment of a patient having need to agonise the 
polypeptide. 

The present invention also provides use of the compound described above in the manufacture of a medicament for the 
20 treatment of metabolic disease. 

In a preferred embodiment, the present invention also provides use of the compound described above in the 
manufacture of a medicament for the treatment of diabetes. 

Yet further provided by the present invention is use of the antibody described above in the manufacture of a medicament 
for the treatment of metabolic disease. 
25 Also provided by the present invention is use of the antibody described above in the manufacture of a medicament for 

the treatment of diabetes. 

The present invention also provides a method for the treatment of metabolic disease in a patient comprising 
administering to the patient a therapeutically effective amount of the compound described above. 

In a preferred embodiment, the present invention also provides a method for the treatment of diabetes in a patient 
30 comprising administering to the patient a therapeutically effective amount of the compound described above. 

Yet further provided by the present invention is a method for the treatment of metabolic disease in a patient comprising 
administering to the patient a therapeutically effective amount of the antibody described above. 

Also provided by the present invention is a method for the treatment of diabetes in a patient comprising administering to 
the patient ~a therapeutically effective amount of the antibody described above. 
35 According to yet a further aspect of the present invention, there are. provided cells genetically engineered ex vivo or in 

vivo to overexpress, vnderexpress or to exhtbif targeted insertion or deletion of the^pplypeptide of the present invention. 
Preferably, said cells genetically engineered in vivo (and the other cells described above) are comprised within a non- 
human animal. 

40 PFI-007 POLYPEPTIDE 

As explained above, the present invention relates to a novel GPCR - which has been internally designated PFI-007 - and 
to a nucleotide sequence encoding same. The present invention also relates to the use of the novel nucleic acid and amino 
acid sequences in the diagnosis and treatment of disease. The present invention also relates to the use of the novel nucleic 

45 acid and amino acid sequences to evaluate and/or to screen for agents that can modulate the GPCR. The present invention 
further relates to genetically engineered host cells that comprise or express the novel nucleic acid and amino acid 
sequences to evaluate and/or to screen for agents that can modulate the GPCR. 

The PFI-007 polypeptide may be the same as the naturally occurring form - for this aspect, preferably the PFI-007 
polypeptide is the non-native amino acid sequence (i.e. it is not present in its natural environment) - or is a variant, 
homologue, fragment or derivative thereof. In addition, or in the alternative, the PFI-007 polypeptide is isolated PFI-007 
polypeptide and/or purified PFI-007 polypeptide. The PFI-007 polypeptide can be obtainable from or produced by any 
suitable source, whether natural or not, or it may be synthetic, semi-synthetic or recombinant. 

The PFI-007 coding sequence may be the same as the naturally occurring form - for this aspect, preferably the PFI-007 
coding sequence is the non-native nucleotide sequence (ie. it is not present in its natural environment) - or is a variant, 
homologue, fragment or derivative thereof. In addition, or in the alternative, the PFI-007 coding sequence is an isolated PFI- 

55 007 coding sequence and/or a purified PFI-007 coding sequence. The PFI-007 coding sequence can be obtainable from or 
produced by any suitable source, whether natural or not, or it may be synthetic, semi-synthetic or recombinant. 
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PFI-007 POLYPEPTIDE AND SCREENING 



The PFI-007 polypeptide and/or its coding sequence and/or a sequence capable of hybridising thereto is/are useful for 
testing the selectivity of drug candidates between different GPCRs. 
5 It has been demonstrated (herein) that PFI-007 is related to aminergic receptors and is therefore a novel GPCR likely to 

have a small molecule, amine-like ligand. 

Drugs that modulate the novel PFI-007 receptor will therefore be likely to modulate signal transduction processes. It is 
therefore likely that modulators of the PFI-007 receptor may be useful for the treatment of many different disorders 
associated with signal transduction that will most likely include, but is not limited to, diabetes and metabolic disease, 
10 neurological disease, psychotherapeutics, urogenital disease, reproduction and sexual medicine, inflammation, cancer, 
tissue repair, dermatology, skin pigmentation, photoageing, frailty, osteoporosis, cardiovascular disease, gastrointestinal 
disease, antiinfection, allergy and respiratory disease, sensory organ disorders, sleep disorders and hairloss. 

Thus, the PFI-007 polypeptide and/or its coding sequence and/or a sequence capable of hybridising thereto may be 
useful for screening drug candidates for the treatment of diseases associated with signal transduction. In addition, it is 
)5 believed that PFI-007 polypeptide and/or its coding sequence and/or a sequence capable of hybridising thereto may be 
useful for screening drug candidates for the treatment of diseases such as those described above. 

Either or both of the nucleotide sequences coding for the PFI-007 polypeptide or the PFI-007 polypeptide itself may be 
used to screen for agents that can affect GPCR activity. In particular, the nucleotide sequence coding for the PFI-007 
receptor itself may be used to screen for agents that can antagonise GPCR activity. In addition, the nucleotide sequence 
20 coding for PFI-007 polypeptide or the PFI-007 polypeptide itself may be used to screen for agents that selectively affect 
GPCR activity, such as selectively antagonise the PFI-007 receptor. 

POLYPEPTIDE OF THE PRESENT INVENTION 



25 



30 



The term "polypeptide" - which is interchangeable with the term "protein" - includes single-chain potypeptide molecules 
as well as multiple-polypeptide complexes where individual constituent polypeptides are linked by covalent or non-covalent 
means. 

Preferably, the polypeptide of the present invention is a single-chain polypeptide. 

Polypeptides of the present invention may be in a substantially isolated form. It will be understood that the polypeptide 
may be mixed with carriers or diluents which will not interfere with the intended purpose of the polypeptide and still be 
regarded as substantially isolated. A polypeptide of the present invention may also be in a substantially purified form, in 
which case it will generally comprise the polypeptide in a preparation in which more than 90%, e.g. 95%, 98% or 99% of the 
polypeptide in the preparation is a polypeptide of the present invention. Polypeptides of the present invention may be 
modified for example by the addition of histidine residues to assist their purification. 

Polypeptides of the present invention may be produced by synthetic means (e.g. as described by Geysen et a!., 1996) 
^ Of :recombinantly ( ' as described beJow. 

tn a preferred embodiment, the amino acid sequence perse of the present invention does riot cover the native PFI-007 
receptor according to the present invention when it is in its natural environment and when it has been expressed by its 
native nucleotide coding sequence which is also in its natural environment and when that nucleotide sequence is under the 
control of its native promoter which is also in its natural environment. For ease of reference, we have called this preferred 
40 embodiment the "non-native amino acid sequence". 

The terms "variant", "homologue", "fragment", "analogue" or "derivative" in relation to the amino acid sequence for the 
polypeptide of the present invention include any substitution of, variation of, modification of, replacement of, deletion of 
or addition of one (or more) amino acid from or to the sequence providing the resultant polypeptide has GPCR activity, 
preferably being at least as biologically active as the polypeptide shown in attached SEQ ID NO: 2. In particular, the term 
45 "homologue" covers homology with respect to structure and/or function. With respect to sequence homology, there is at 
least 70%, preferably at least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at least 
90%, more preferably at least 95% homology to the sequence shown in SEQ ID NO: 2. Most preferably there is at least 
98% homology to the sequence shown in SEQ ID NO: 2. 

Typically, for the variant, homologue or fragment of the present invention, the types of amino acid substitutions that 
50 could be made should maintain the hydrophobicity/hydrophilictty of the amino acid sequence. Amino acid substitutions may 
be made, for example from 1, 2 or 3 to 10, 20 or 30 substitutions provided that the modified sequence retains the ability to 
act as a GPCR in accordance with the present invention. Amino acid substitutions may include the use of non-naturally 
occurring analogues. 

The amino acid sequence of the present invention may be produced by expression of a nucleotide sequence coding for 
55 same in a suitable expression system. 

In addition, or in the alternative, the protein itsetf could be produced using chemical methods to synthesize a PFI-007 
polypeptide, in whole or in part. For example, peptides can be synthesized by solid phase techniques, cleaved from the 
resin, and purified by preparative high performance liquid chromatography (e.g. Creighton (1983) Proteins Structures and 
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Molecular Principles, WH Freeman and Co., New York, NY, USA). The composition of the synthetic peptides may be 
confirmed by amino acid analysis or sequencing (e.g. the Edman degradation procedure). 

Direct peptide synthesis can be performed using various solid-phase techniques (Roberge JY et at Science Vol 269 1995 
202-204) and automated synthesis may be achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer) 
5 in accordance with the instructions provided by the manufacturer. Additionally, the amino acid sequence of PFI-007, or any 
part thereof, may be altered during direct synthesis and/or combined using chemical methods with a sequence from other 
subunits, or any part thereof, to produce a variant polypeptide. 

In another embodiment of the invention, a PFI-007 natural, modified or recombinant amino acid sequence may be ligated 
to a heterologous sequence to encode a fusion protein. For example, for screening of libraries for compounds and peptide 
10 agonists and antagonists of PFI-007 GPCR activity, it may be useful to encode a chimeric PFI-007 protein expressing a 
heterologous epitope that is recognised by a commercially available antibody. A fusion protein may also be engineered to 
contain a cleavage site located between a PFI-007 sequence and the heterologous protein sequence, so that the PFI-007 
may be cleaved and purified away from the heterologous moiety. 

PFI-007 may also be expressed as a recombinant protein with one or more additional polypeptide domains added to 
15 facilitate protein purification. Such purification facilitating domains include, but are not limited to, metal chelating 
peptides such as histidine-tryptophan modules that allow purification on immobilised metals (Porath J, Protein Expr Purif 
Vol 3 1992 p263-281), protein A domains that allow purification on immobilised immunoglobulin, and the domain utilised in 
the FLAGS extension/affinity purification system (Immunex Corp, Seattle, WA, USA). The inclusion of a cleavable linker 
sequence such as Factor XA or enterokinase (Invttrogen, San Diego, CA, USA) between the purification domain and PFI- 
007 is useful to facilitate purification. 

A specific amino acid sequence of PFI-007 is shown in SEQ ID NO: 2. However, the present invention encompasses 
amino acid sequences encoding other GPCRs which would include amino acid sequences having at least 70% identity 
(preferably at least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at least 90%, more 
preferably at least 95%, most preferably at least 98% identity) to that specific amino acid sequence. 

Polypeptides of the present invention also include fragments of the present amino acid sequence and variants thereof. 
Suitable fragments will be at least 5, e.g. at least 10, 12, 15 or 20 amino acids in size. 

Polypeptides of the present invention may also be modified to contain one or more (e.g. at least 2, 3, 5, or 10) 
substitutions, deletions or insertions, including conserved substitutions. These aspects are discussed in a later section. 
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NUCLEOTIDE SEQUENCE OF THE PRESENT INVENTION 



The term "nucleotide sequence" as used herein refers to an oligonucleotide sequence or polynucleotide sequence, and 
variants, homologues, fragments, analogues and derivatives thereof (such as portions thereof). The nucleotide sequence 
may be DNA or RNA which may be of genomic or synthetic or recombinant origin which may-be double-stranded 6> single? 
stranded whether representing the sense or antisense strand. 
35 Preferably, the tern? "nucleotide sequence" means DNA. 

More preferably, the term "nucleotide sequence" means DNA prepared by use of recombinant DNA techniques (i.e. 
recombinant DNA). 

In a preferred embodiment, the nucleotide sequence perse of the present invention does not cover the native nucleotide 
coding sequence according to the present invention in its natural environment when it is under the control of its native 
promoter which is also in its natural environment. For ease of reference, we have called this preferred. embodiment the 
"non-native nucleotide sequence". 

The nucleotide sequences of the present invention may include within them synthetic or modified nucleotides. A number 
of different types of modification to oligonucleotides are known in the art. These include methylphosphonate and 
phosphorothioate backbones, addition of acridine or polylysine chains at the 3* and/or 5* ends of the molecule. For the 
45 purposes of the present invention, it is to be understood that the nucleotide sequences described herein may be modified 
by any method available in the art. Such modifications may be earned out in to enhance the in vivo activity or life span 
of nucleotide sequences of the present invention. 

The present invention also encompasses nucleotide sequences that are complementary to the sequences presented 
herein, or any variant, homologue, analogue, fragment or derivative thereof. If the sequence is complementary to a 
50 fragment thereof then that sequence can be used a probe to identify similar coding sequences in other organisms, etc. 

The present invention also encompasses nucleotide sequences that are capable of hybridising to the sequences 
presented herein, or any variant homologue, analogue, fragment or derivative thereof. 

The present invention also encompasses nucleotide sequences that are capable of hybridising to the sequences that are 
complementary to the sequences presented herein, or any variant homologue, analogue, fragment or derivative thereof. 
55 The term "variant" also encompasses sequences that are complementary to sequences that are capable of hydridising to 
the nucleotide sequences presented herein. 

Preferably, the term "variant" encompasses sequences that are complementary to sequences that are capable of 



-7- 



EP 1 092 727 A2 

hydridising under stringent conditions (e.g. 65 W C and O.lxSSC {1xSSC = 0.15 M NaCI. 0.015 Na 3 citrate pH 7.0}) to the 
nucleotide sequences presented herein. 

The present invention also relates to nucleotide sequences that can hybridise to the nucleotide sequences of the 
present invention (including complementary sequences of those presented herein). 

The present invention also relates to nucleotide sequences that are complementary to sequences that can hybridise to 
the nucleotide sequences of the present invention (including complementary sequences of those presented herein). 

Also included within the scope of the present invention are polynucleotide sequences that are capable of hybridising to 
the nucleotide sequences presented herein under conditions of intermediate to maximal stringency. 

In a preferred aspect, the present invention covers nucleotide sequences that can hybridise to the nucleotide sequence 
of the present invention, or the complement thereof, under stringent conditions (e.g. 65°C and O.lxSSC). 

Exemplary nucleic acids can alternatively be characterised as those nucleotide sequences which encode a PFI-007 
protein and hybridise to the DNA sequence shown in SEQ ID NO: 1. Preferred are such sequences encoding PFI-007 
which hybridise under high-stringency conditions to the sequence shown in SEQ ID NO: 1 or the complement thereof. 

Advantageously, the invention provides nucleic acid sequences which are capable of hybridising, under stringent 
conditions, to a fragment of the sequence shown in the SEQ ID NO: 1 or the complement thereof. Preferably, the fragment 
is between 15 and 50 bases in length. Advantageously, it is about 25 bases in length. 

The terms "variant", "homologue", "analogue", "derivative" or "fragment" in relation to the nucleotide sequence coding for 
the preferred polypeptide of the present invention include any substitution of, variation of, modification of, replacement of, 
deletion of or addition of one (or more) nucleic acid from or to the sequence providing the resultant nucleotide sequence 
codes for or is capable of coding for an polypeptide having PFI-007 receptor activity, preferably being at least as 
biologically active as the polypeptide encoded by the sequence shown in SEQ ID NO: 1. In particular, the term 
"homologue" covers homology with respect to structure and/or function providing the resultant nucleotide sequence codes 
for or is capable of coding for a polypeptide having activity as a PFI-007 GPCR. With respect to sequence homology, 
preferably there is at least 70%, preferably at least 75%, more preferably at least 80%, more preferably at least 85%, more 
preferably at least 90%, more preferably at least 95% homology to a nucleotide sequence coding for the amino acid 
sequence shown in SEQ ID NO: 2. Most preferably there is at least 98% homology to a nucleotide sequence coding for the 
amino acid sequence shown in SEQ ID NO: 2. With respect to sequence homology, there is at least 70%, preferably at 
least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at least 90%, more preferably at 
least 95% homology to the nucleotide sequence shown in SEQ ID NO: 1 . Most preferably there is at least 98% homology to 
the nucleotide sequence shown in SEQ ID NO: 1 . 

As indicated, the present invention relates to a DNA sequence (preferably a cDNA sequence) encoding PFI-007. In 
particular, the present invention relates to cDNA sequences encoding PFI-007. 

The present invention also relates to DNA segments comprising the DNA sequence shown in SEQ ID NO: 1 or allelic 
variation^ thereof^ 

The present invention also relates to polypeptides produced by expression in a host qell into which has been 
incorporated; the foregoing DNA sequences or allelic variations thereof. 

The present invention also relates provides DNA comprising the DNA sequence shown in SEQ ID NO: 1 or allelic 
variations thereof. 

The present invention also relates to non-native DNA comprising the DNA sequence shown in SEQ ID NO: 1 or allelic 
variations thereof. 

A highly preferred aspect of the present invention relates to recombinant DNA comprising the DNA sequence shown in 
SEQ ID NO: 1 or allelic variations thereof. 

Polynucleotides of the present invention include nucleic acid sequences encoding the polypeptides of the present 
invention. It will be appreciated that a range of different polynucleotides encode a given amino acid sequence as a 
consequence of the degeneracy of the genetic code. 

By knowledge of the amino acid sequences set out herein it is possible to devise partial and full-length nucleic acid 
sequences such as cDNA and/or genomic clones that encode the polypeptides of the present invention. For example, 
polynucleotides of the present invention may be obtained using degenerate polymerase chain reaction (PCR) which will 
use primers designed to target sequences encoding the amino acid sequences presented herein. The primers will typically 
contain multiple degenerate positions. However, to minimise degeneracy, sequences will be chosen that encode regions of 
the amino acid sequences presented herein containing amino acids such as methionine which are coded for by only one 
triplet. In addition, sequences will be chosen to take into account codon usage in the organism whose nucleic acid is used 
as the template DNA for the PCR procedure. PCR will be used at stringency conditions lower than those used for cloning 
sequences with single sequence (non-degenerate) primers against known sequences. 

Nucleic acid sequences obtained by PCR that encode polypeptide fragments of the present invention may then be used 
to obtain larger sequences using hybridisation library screening techniques. For example a PCR clone may be labelled with 
radioactive atoms and used to screen a cDNA or genomic library from other species, preferably other mammalian species. 
Hybridisation conditions will typically be conditions of medium to high stringency (for example 0.03M sodium chloride and 
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0.03M sodium citrate at from about 50*C to about 60*C). 

Degenerate nucleic acid probes encoding all or part of the amino acid sequence may also be used to probe cDNA 
and/or genomic libraries from other species, preferably other mammalian species. However, it is preferred to carry out PCR 
techniques initially to obtain a single sequence for use in further screening procedures. 

5 In accordance with the present invention, polynucleotide sequences which encode PFI-007, fragments of the 

polypeptide, fusion proteins or functional equivalents thereof, may be used to generate recombinant DNA molecules that 
direct the expression of PFI-007 in appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA 
sequences which encode substantially the same or a functionally equivalent amino acid sequence, may be used to clone 
and express PFI-007. As will be understood by those of skill in the art, it may be advantageous to produce PFI-007- 

f 0 encoding nucleotide sequences possessing non-naturally occurring codons. Codons preferred by a particular prokaryotic or 
eukaryotic host (Murray E et at (1989) Nuc Acids Res 1 7:477-508) can be selected, for example, to increase the rate of PFI- 
007 expression or to produce recombinant RNA transcripts having desirable properties, such as a longer half-life, than 
transcripts produced from naturally occurring sequence. 

Polynucleotide sequences of the present invention obtained using the techniques described above may be used to 

15 obtain further homologous sequences and variants using the techniques described above. They may also be modified for 
use in expressing the polypeptides of the present invention in a variety of host cells systems, for example to optimise codon 
preferences for a particular host cell in which the polynucleotide sequences are being expressed. Other sequence changes 
may be desired in order to introduce restriction enzyme recognition sites, or to alter the property or function of the 
polypeptides encoded by the polynucleotides. 

2 q Altered PFI-007 polynucleotide sequences which may be used in accordance with the invention include deletions, 

insertions or substitutions of different nucleotide residues resulting in a polynucleotide that encodes the same or a 
functionally equivalent PFI-007. The protein may also have deletions, insertions or substitutions of amino acid residues 
which produce a silent change and result in a functionally equivalent PFI-007. Deliberate amino acid substitutions may be 
made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues as long as the biological activity of PFI-007 is retained. For example, negatively charged amino acids 

25 incjude aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; and amino acids with 
uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, 
asparagine, glutamine, serine, threonine, phenylalanine, and tyrosine. 

Included within the scope of the present invention are alleles of PFI-007. As used herein, an "allele" or "allelic 
sequence" is an alternative form of PFI-007. Alleles result from a mutation, i.e. a change in the nucleic acid sequence, and 

30 generally produce altered mRNAs or polypeptides whose structure or function may or may not be altered. Any given gene 
may have none, one or many allelic forms. Common mutational changes which give rise to alleles are generally ascribed to 
deletions, additions or substitutions of amino acids. Each of these types of changes may occur alone, or in combination 
with the others, one or more times in a given sequence. 

The nucleotide sequences of the present invention may be engineered in order to alter a PFI-007 coding sequence for a 

2$ variety of reasons, facfudihg^but not limited to, Alterations whiclv modify the cloning, processing and/or -expression ef :the 
gene product. For example, mutations may be introduced using techniques which are well known in the art, e.g. site- 
directed mutagenesis to insert new restriction sites, to alter glycosylation patterns or to change codon preference. 

Polynucleotides of the present invention may be used to produce a primer, e.g. a PCR primer, a primer for an alternative 
amplification reaction, a probe e.g. labelled with a revealing label by conventional means using radioactive or non- 
40 radioactive labels, or the polynucleotides may be cloned into vectors. Such primers, probes and other fragments will be at 
least 15, preferably at least 20, for example at least 25, 30 or 40 nucleotides in length, and are also encompassed by the 
term polynucleotides of the present invention as used herein. 

Polynucleotides or primers of the present invention may carry a revealing label. Suitable labels include radioisotopes 
such as 32 aP or ^aS, enzyme labels, or other protein labels such as biotin. Such labels may be added to polynucleotides 

45 or primers of the present invention and may be detected using by techniques known in the art. 

Polynucleotides such as a DNA polynucleotide and primers according to the present invention may be produced 
recombinantty, synthetically, or by any means available to those of skill in the art. They may also be cloned by standard 
techniques. 

In general, primers will be produced by synthetic means, involving a step-wise manufacture of the desired nucleic acid 
50 sequence one nucleotide at a time. Techniques for accomplishing this using automated techniques are readily available in 
the art. 

Longer polynucleotides will generally be produced using recombinant means, for example using PCR cloning 
techniques. This will involve making a pair of primers (e.g. of about 15-30 nucleotides) to a region of the nucleotide 
sequence which it is desired to done, bringing the primers into contact with mRNA or cDNA obtained from a eukaryotic or 
55 prokaryotic cell, performing a polymerase chain reaction under conditions which bring about amplification of the desired 
region, isolating the amplified fragment (e.g. by purifying the reaction mixture on an agarose gel) and recovering the 
amplified DNA. The primers may be designed to contain suitable restriction enzyme recognition sites so that the amplified 
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DNA can be cloned into a suitable cloning vector. 

DNA molecules may be modified to increase intracellular stability and half-life. Possible modifications include, but are 

not limited to, the addition of flanking sequences of the 5' and/or 3' ends of the molecule or the use of phosphorothioate or 

2' O-methyf rather than phosphodiesterase linkages within the backbone of the molecule. 
5 As mentioned earlier, the present invention also relates to nucleotide sequences that are capable of hybridising to all or 

part of the sequence shown in SEQ ID NO: 1 or an allelic variation thereof. These nucleotide sequences may be used in 

antisense techniques to modify PFI-007 expression. Alternatively, these sequences (or portions thereof) can be used as a 

probe, or for amplifying all or part of such sequence when used as a PGR primer. 

In addition to the recombinant DNA sequences, genomic sequences are also of utility in the context of drug discovery. It 
10 may be valuable to inhibit the mRNA transcription of a particular isoform rather than to inhibit its translated protein. This 

may be true with PFI-007, if there are splice variants and wherein those different splice variants may be transcribed from 

different promoters. 

Another utility of the invention is that the DNA sequences, once known, give the information needed to design assays to 
specifically detect isoforms or splice variants. Isoform-specific PCR primer pairs are but one example of an assay that 

i 5 depends completely on the knowledge of the specific DNA sequence of the isoform or splice variant. Such an assay allows 
detection of mRNA for the isoform to access the tissue distribution and biological relevance of each isoform to a particular 
disease state. It also allows identification of cell lines that may naturally express only one isoform - a discovery that 
might obviate the need to express recombinant genes. If specific PFI-007 isoforms are shown to be associated with a 
particular disease state, the invention would be valuable in the design of diagnostic assays to detect the presence of 

20 isoform mRNA. 

An abnormal level of nucleotide sequences encoding a PFI-007 receptor in a biological sample may reflect a 
chromosomal aberration, such as a nucleic acid deletion or mutation. Accordingly, nucleotide sequences encoding a PFI- 
007 receptor provide the basis for probes which can be used diagnostically to detect chromosomal aberrations such as 
deletions, mutations or chromosomal translocations in the gene encoding PFI-007. PFI-007 gene expression may be altered 
25 in such disease states or there may be a chromosomal aberration present in the region of the gene encoding PFI-007. 

In an alternative embodiment of the invention, the coding sequence of PFI-007 could be synthesized, in whole or in part, 
using chemical methods well known in the art (see Caruthers MH et al (1980) Nuc Acids Res Symp Ser 215-23, Horn T et 
al (1980) Nuc Acids Res Symp Ser 225-232). 
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NATURALLY OCCURRING 

As used herein "naturally occurring" refers to a PFI-007 with an amino acid sequence found in nature. 
ISOLATED/PURIFIED 

jffe used, herein;, the terms "isolated" and "purified? refer to molecules^ either fiucleic or amino acid : sequences; that are 
removed from their natural environment and isolated or separated "from at least one other component with which they are 
naturally associated. 

BIOLOGICALLY ACTIVE 



As used herein "biologically active" refers to a PFI-007 according to the present invention - such as a recombinant PFI- 
007 - having a similar structural function (but not necessarily to the same degree), and/or similar regulatory function (but 
not necessarily to the same degree), and/or similar biochemical function (but not necessarily to the same degree) and/or 
immunological activity (but not necessarily to the same degree) of the naturally occurring PFI-007. Specifically, a PFI-007 
45 of the present invention has the ability to act as a GPCR, which is one of the characteristic activities of the PFI-007 
polypeptide of the present invention. 

IMMUNOLOGICAL ACTIVITY 

^ As used herein, "immunological activity" is defined as the capability of the natural, recombinant or synthetic PFI-007 or 

any oligopeptide thereof, to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 



DERIVATIVE 

The term "derivative" as used herein in relation to the amino acid sequence includes chemical modification of a PFI-007. 
Illustrative of such modifications would be replacement of hydrogen by an alkyl, acyl, or amino group. 
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ANALOGUE 

The term "analogue" as used herein in relation to the amino acid sequence (or the coding sequence thereof) includes 
chemical modification of a PF1-007 or the coding sequence thereof. Illustrative of such modifications would be replacement 
of natural amino acid residues or natural nucleotides with non-natural amino acid residues (e.g. D-amino acids, beta- 
alanine, hydroxyproline) or non-natural nucleotides (e.g. inosine, demethyl-cytidine). 

DELETION 

As used herein a "deletion" is defined as a change in either nucleotide or amino acid sequence in which one or more 
nucleotides or amino acid residues, respectively, are absent. 

INSERTION/ADDITION 

As used herein an "insertion" or "addition" is a change in a nucleotide or amino acid sequence which has resulted in the 
addition of one or more nucleotides or amino acid residues, respectively, as compared to the naturally occurring PFI-007. 

SUBSTITUTION 

As used herein "substitution" results from the replacement of one or more nucleotides or amino acids by different 
?0 nucleotides or amino acids, respectively. 

HOMOLOGUE 

The term "homologue" with respect to the nucleotide sequence of the present invention and the amino acid sequence of 
25 the present invention may be synonymous with allelic variations of the sequences. 

In particular, the term "homology" as used herein may be equated with the term "identity". Here, sequence homology 
with respect to the nucleotide sequence of the present invention and the amino acid sequence of the present invention can 
be determined by a simple "eyeball" comparison (i.e. a strict comparison) of any one or more of the sequences with another 
sequence to see if that other sequence has at least 70% identity to the nucleotide and amino acid sequences. Relative 
30 sequence homology (i.e. sequence identity) can also be determined by commercially available computer programs that can 
calculate percentage (%) homology between two or more sequences. Typical examples of such computer programs are 
CLUSTAL or BLAST. 

Percentage (%) homology may be calculated over contiguous sequences, i.e. one sequence is aligned with the other 
sequence and each amino acid in one sequence directly compared .with the corresponding amino acid in the other 
35 sequence, one residue at a time. This is called ah "ungapped" alignment. Typically, such ungapped alignments are 
performed only over a relatively short number of residues (for example less than 50 contiguous amino acids). 

Although this is a very simple and consistent method, it fails to take into consideration that, for example, in an 
otherwise identical pair of sequences, one insertion or deletion will cause the following amino acid residues to be put out 
of alignment, thus potentially resulting in a large reduction in % homology when a global alignment is performed. 
40 Consequently, most sequence comparison methods are designed to produce optimal alignments that take into 
consideration possible insertions and deletions without penalising unduly the overall homology score. This is achieved by 
inserting "gaps" in the sequence alignment to try to maximise local homology. 

However, these more complex methods assign "gap penalties" to each gap that occurs in the alignment so that, for the 
same number of identical amino acids, a sequence alignment with as few gaps as possible - reflecting higher relatedness 
between the two compared sequences - will achieve a higher score than one with many gaps. "Affine gap costs" are 
typically used that charge a relatively high cost for the existence of a gap and a smaller penalty for each subsequent 
residue in the gap. -This is the most commonly used gap scoring system. High gap penalties will of course produce 
optimised alignments with fewer gaps. Most alignment programs allow the gap penalties to be modified. However, it is 
preferred to use the default values when .using such software for sequence comparisons. For example when using the 
GCG Wisconsin Bestfit package (see below) the default gap penalty for amino acid sequences is -12 for a gap and -4 for 
each extension. 

Calculation of maximum % homology therefore firstly requires the production of an optimal alignment, taking into 
consideration gap penalties. A suitable computer program for carrying out such an alignment is the GCG Wisconsin Bestfit 
package (University of Wisconsin, U.S.A.; Devereux et a/., 1984, Nucleic Acids Research 12:387). Examples of other 
software that can perform sequence comparisons include, but are not limited to, the BLAST package (see Ausubeler 
55 al, 1999 ibid - Chapter 18), FASTA (Altschul et a/., 1990, J. Mol. Biol., 403-410) and the GENEWORKS suite of comparison 
tools. Both BLAST and FASTA are available for off-line and on-line searching (see Ausubel et a/., 1999 ibid, pages 7-58 to 
7-60). However, for some applications it is preferred to use the GCG Bestfit program. 
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Although the final % homology can be measured in terms of identity, in some cases, the alignment process rtsetf is 
typically not based on an all-or-nothing pair comparison. Instead, a scaled similarity score matrix is generally used that 
assigns scores to each pairwise comparison based on chemical similarity or evolutionary distance. An example of such a 
matrix commonly used is the BLOSUM62 matrix - the default matrix for the BLAST suite of programs. GCG Wisconsin 

5 programs generally use either the public default values or a custom symbol comparison table if supplied (see user manual 
for further details). It is preferred to use the public default values for the GCG package, or in the case of other software, 
the default matrix, such as BLOSUM62. 

Once the software has produced an optimal alignment, it is possible to calculate % homology, preferably % sequence 
identity. The software typically does this as part of the sequence comparison and generates a numerical result. 

10 As indicated, for some applications, sequence homology (or identity) may be determined using any suitable homology 

algorithm, using for example default parameters. For a discussion of basic issues in similarity searching of sequence 
databases, see Altschul et al. (1994) Nature Genetics 6:1 19-129. For some applications, the BLAST algorithm is employed, 
with parameters set to default values. The BLAST algorithm is described in detail at http: 
//www.ncbi.nih.gov/BLAST/blast_help.html. Advantageously, "substantial homology", when assessed by BLAST, equates to 

15 sequences which match with an EXPECT value of at least about 7, preferably at least about 9 and most preferably 10 or 
more. The default threshold for EXPECT in BLAST searching is usually 10. 

Should Gap Penalties be used when determining sequence identity, then preferably the following parameters are used: 
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35 Other computer program methods to determine iden% arifr similarity between the two sequences include but are not 
limited to the GCG program package (Devereux et a/., 1984 Nucleic Acids Research 12: 387) and FAST A*(Ats6hul et al 
1990 J Molec Biol 403-410). 

POLYPEPTIDE VARIANTS AND DERIVATIVES 

40 

The terms "variant" or "derivative" in relation to the amino acid sequences of the present invention includes any 
substitution of, variation of, modification of, replacement of, deletion of or addition of one (or more) amino acids from or 
to the sequence providing the resultant amino acid sequence has PFI-007 activity, preferably having at least the same 
activity as the polypeptide presented in SEQ ID. NO: 2. 
45 The sequences of the present invention may be modified for use in the present invention. Typically, modifications are 

made that maintain the PFI-007 activity of the sequence. Amino acid substitutions may be made, for example from 1 , 2 or 3 
to 10, 20 or 30 substitutions provided that the modified sequence retains PFI-007 activity. Amino acid substitutions may 
include the use of non-naturally occurring analogues, for example to increase blood plasma half-life of a therapeutically 
administered polypeptide. 

Conservative substitutions may be made, for example according to the Table below. Amino acids in the same block in 
the second column and preferably in the same line in the third column may be substituted for each other: 
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As indicated above, proteins of the invention are typically made by recombinant means, for example as described herein, 
15 and/or by using synthetic means using techniques well known to the skilled person such as solid phase synthesis. Variants 
and derivatives of such sequences include fusion proteins, wherein the fusion proteins comprise at least the amino acid 
sequence of the present invention being linked (directly or indirectly) to another amino acid sequence. These other amino 
acid sequences - which are sometimes referred to as fusion protein partners - will typically impart a favourable 
functionality - such as to aid extraction and purification of the amino acid sequence of the present invention. Examples of 
fusion protein partners include glutathione-S-transferase (GST), 6xHis, GAL4 (DNA binding and/or transcriptional activation 
domains) and p-galactosidase. It may also be convenient to include a proteolytic cleavage site between the fusion protein 
partner and the protein sequence of the present invention so as to allow removal of the latter. Preferably the fusion protein 
partner will not hinder the function of the protein of the present invention. 
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POLYNUCLEOTIDE VARIANTS AND DERIVATIVES 



The terms "variant" or "derivative" in relation to the nucleotide sequence of the present invention include any 
substitution of, variation of, modification of, replacement of, deletion of or addition of one (or more) nucleic acid from or 
to the sequence providing the resultant nucleotide sequence codes for a polypeptide having PFI-007 activity, preferably 
having at least the same activity as the sequence presented in SEQ ID NO: 2. 

As indicated above, with respect to sequence homology, there is at least 70%, preferably at least 75%, more preferably 
at least 80%, more preferably at least 85%, more preferably at least 90%, more preferably at least 95% homology to the 
nucleotide sequence shown in SEQ ID NO: 1. Most preferably there is at least 98% homology to the nucleotide sequence 
shown in SEQ ID NO: 1 Nucleotide homology comparisons may be conducted as described, above. For some applications, 
a preferred sequence comparison, program is the G€G Wisconsin Bestfit program described above. The default scoring 
55 matrix has a match value pf 10 for each identical nucleotide and -9 for each mismatoh. The default gap creation penalty/ 
is -50 and the default gap extension penalty is -3 for each nucleotide. 

As used herein, the terms "variant", "homologue", "fragment" and "derivative" embrace allelic variations of the 
sequences. 

The term "variant" also encompasses sequences that are complementary to sequences that are capable of hydridising to 
40 the nucleotide sequences presented herein. 

HYBRIDISATION 

The term "hybridisation" as used herein shali include "the process by which a strand of nucleic acid joins with a 
45 complementary strand through base pairing" (Coombs J (1994) Dictionary of Biotechnology, Stockton Press, New York, NY, 
USA) as well as the process of amplification as carried out in PCR technologies as described in Dieffenbach CW and GS 
Dveksler (1995, PCR Primer, a Laboratory Manual, Cold Spring Harbor Press, Plainview, NY, USA). 

Hybridisation conditions are based on the melting temperature (T m) of the nucleic acid binding complex, as taught in 
Berger and Kimmel (1987, Guide to Molecular Cloning Techniques, Methods in Enzymology, Vol 152, Academic Press, San 
50 Diego, CA, USA), and confer a defined "stringency" as explained below. 

Stringency of hybridisation refers to conditions under which pofynucleic acids hybrids are stable. Such conditions are 
evident to those of ordinary skill in the field. As known to those of skill in the art, the stability of hybrids is reflected 
in the melting temperature (Tm) of the hybrid which decreases approximately 1 to 1.5°C with every 1% decrease in 
sequence homology. In general, the stability of a hybrid is a function of sodium ion concentration and temperature. 
55 Typically, the hybridisation reaction is performed under conditions of higher stringency, followed by washes of varying 
stringency. 

As used herein, high stringency refers to conditions that permit hybridisation of only those nucleic acid sequences that 
form stable hybrids in 1 M Na + a at 65-68°C. 
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Maximum stringency typically occurs at about Tm-5*C (5*C below the Tm of the probe). 

High stringency occurs at about 5*C to 10*C below the Tm of the probe. High stringency conditions can be provided, for 
example, by hybridisation in an aqueous solution containing 6x SSC, 5x Denhardt's, 1% SDS (sodium dodecyl sulphate), 
0.1 Na + a pyrophosphate and 0.1 mg/ml denatured salmon sperm DNA as non-specific competitor. Following hybridisation, 
5 high stringency washing may be done in several steps, with a final wash (about 30 min) at the hybridisation temperature in 
0.2-0.1xSSC,0.1%SDS. 

Moderate, or intermediate, stringency typically occurs at about 10°C to 20°C below the Tm of the probe. 
Low stringency typically occurs at about 20*C to 25°C below the Tm of the probe. 

As will be understood by those of skill in the art, a maximum stringency hybridisation can be used to identify or detect 
f 0 identical polynucleotide sequences while an intermediate (or low) stringency hybridisation can be used to identify or detect 
similar or related polynucleotide sequences. 

Moderate stringency refers to conditions equivalent to hybridisation in the above-described solution but at about 60- 
62*C. In that case the final wash is performed at the hybridisation temperature in Ix SSC, 0.1% SDS. 

Low stringency refers to conditions equivalent to hybridisation in the above-described solution at about 50-52 w C. In that 
15 case, the final wash is performed at the hybridisation temperature in 2x SSC, 0.1 % SDS. 

It is understood that these conditions may be adapted and duplicated using a variety of buffers, e.g. formamide-based 
buffers, and temperatures. Denhardfs solution and SSC are well known to those of skill in the art as are other suitable 
hybridisation buffers (see, e.g., Sambrook, era/., eds. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, New York, NY, USA or Ausubel. et a/., eds. (1990) Current Protocols in Molecular Biology, John Wiley & 
20 Sons, Inc.). Optimal hybridisation conditions have to be determined empirically, as the length and the GC content of the 
probe also play a role. 

Polynucleotides of the invention capable of selectively hybridising to the nucleotide sequences presented herein, or to 
their complement, will be generally at least 70%, preferably at least 75%, more preferably at least 80%, more preferably at 
least 85%, more preferably at least 90%, more preferably at least 95%, most preferably at least 98% homologous to the 

25 corresponding nucleotide sequences presented herein over a region of at least 20, preferably at least 25 or 30, for instance 
at least 40, 60 or 100 or more contiguous nucleotides. 

The term "selectively hybridisable" means that the polynucleotide used as a probe is used under conditions where a 
target polynucleotide of the invention is found to hybridize to the probe at a level significantly above background. The 
background hybridization may occur because of other polynucleotides present, for example, in the cDNA or genomic DNA 

30 library being screened. In this event, background implies a level of signal generated by interaction between the probe and a 
non-specific DNA member of the library which is less than 10-fold, preferably less than 100-fold as intense as the specific 
interaction observed with the target DNA. The intensity of interaction may be measured, for example, by radiolabelling the 
probe, e.g. with 3 %P. 

In a preferred aspect, th6 present invention covers nucleotide sequences that can .hybridise to any one or more of the 
35 nucleotide sequences of the present invention under stringent conditions (e.g. 65*C and O.lxSSC {1xSSC = 0.15 M NaCI, 
0.01S M Na 3 Citrate pH 7.0}). 

Where the polynucleotide of the present invention is double-stranded, both strands of the duplex, either individually or in 
combination, are encompassed by the present invention. Where the polynucleotide is single-stranded, it is to be 
understood that the complementary sequence of that polynucleotide is also included within the scope of the present 
40 invention. 

Polynucleotides which are not 100% homologous to the sequences of the present invention but fall within the scope of 
the invention can be obtained in a number of ways. Other variants of the sequences described herein may be obtained, for 
example, by probing DNA libraries made from a range of individuals, for example individuals from different populations. In 
addition, other viral/bacterial, or cellular homologues particularly cellular homologues found in mammalian cells (e.g. 

45 bovine, ovine, porcine, equine, rodent and primate cells), may be obtained and such homologues and fragments thereof in 
general will be capable of selectively hybridising to the sequence shown in SEQ ID NO: 1 . Such sequences may be 
obtained by probing cDNA libraries made from, or genomic DNA libraries derived from, other animal species, and probing 
such libraries with probes comprising all or part of the sequence shown in SEQ ID NO: 1 under conditions of medium to 
high stringency. Similar considerations apply to obtaining species homologues and allelic variants of the polypeptide or 

^ nucleotide sequences of the invention. 

Variants and strain/species homologues may also be obtained using degenerate PCR which will use primers designed to 
target sequences within the variants and homologues encoding conserved amino acid sequences within the sequences of 
the present invention. Conserved sequences can be predicted, for example, by aligning the amino acid sequences from 
several variants/homologues. Sequence alignments can be performed using computer software known in the art. For 
example the GCG Wisconsin PileUp program is widely used. 

The primers used in degenerate PCR will contain one or more degenerate positions and will be used at stringency 
conditions lower than those used for cloning sequences with single sequence primers against known sequences. 

Alternatively, such polynucleotides may be obtained by site-directed mutagenesis of characterised sequences. This may 
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be useful where, for example, silent codon changes are required to sequences to optimise codon preferences for a 
particular host cell in which the polynucleotide sequences are being expressed. Other sequence changes may be desired 
in order to introduce restriction enzyme recognition sites, or to after the property or function of the polypeptides encoded 
by the polynucleotides. 

5 Polynucleotides of the invention may be used to produce a primer, e.g. a PGR primer, a primer for an alternative 

amplification reaction, a probe e.g. labelled with a revealing label by conventional means using radioactive or non- 
radioactive labels, or the polynucleotides may be cloned into vectors. Such primers, probes and other fragments will be at 
least 15, preferably at least 20, for example at least 25, 30 or 40 nucleotides in length, and are also encompassed by the 
term polynucleotides of the invention as used herein. 

10 Polynucleotides such as DNA polynucleotides and probes according to the invention may be produced recombinantly, 

synthetically, or by any means available to those of skill in the art. They may also be cloned by standard techniques. 

In general, primers will be produced by synthetic means, involving a step wise manufacture of the desired nucleic acid 
sequence one nucleotide at a time. Techniques for accomplishing this using automated techniques are readily available in 
the art. 

15 Longer polynucleotides will generally be produced using recombinant means, for example using PCR cloning 

techniques. This will involve making a pair of primers (e.g. of about 15 to 30 nucleotides) flanking a region of the sequence 
which it is desired to clone, bringing the primers into contact with mRNA or cDNA obtained from an animal or human cell, 
performing a polymerase chain reaction under conditions which bring about amplification of the desired region, isolating the 
amplified fragment (e.g. by purifying the reaction mixture on an agarose gel) and recovering the amplified DNA. The 

2Q primers may be designed to contain suitable restriction enzyme recognition sites so that the amplified DNA can be cloned 
into a suitable cloning vector. 

REGULATORY SEQUENCES 
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Preferably, the polynucleotide of the present invention is operably linked to a regulatory sequence which is capable of 
providing for the expression of the coding sequence, such as by the chosen host cell. By way of example, the present 
invention covers a vector comprising the polynucleotide of the present invention operably linked to such a regulatory 
sequence, i.e. the vector is an expression vector. 

The term "operably linked" refers to a juxtaposition wherein the components described are in a relationship permitting 
them to function in their intended manner. A regulatory sequence "operably linked" to a coding sequence is ligated in such 
a way that expression of the coding sequence is achieved under condition compatible with the control sequences. 

The term "regulatory sequences" includes promoters and enhancers and other expression regulation signals. 

The term "promoter" is used in the normal sense of the art, e.g. an RNA polymerase binding site. 

Enhanced expression of the polynucleotide encoding the polypeptide of the present invention may also be achieved by 
the selection of heterologous regulatory regions, e.g. promoter, secretion leader and terminator regions, which, serve to 
increase exp'ressidh' and, if desired, secretionievelsof the prdteih of interest from the chosen expression host asd/or «h 
provide for the inducible control of the expression of the polypeptide of the present invention. 

Preferably, the nucleotide sequence of the present invention may be operably linked to at least a promoter. 
. Aside from the promoter native to the gene encoding the polypeptide of the present invention, other promoters may be 
used to direct expression of the polypeptide of the present invention. The promoter may be selected for its efficiency in 
directing the expression of the polypeptide of the present invention in the desired expression host. 

' In another embodiment, a constitutive promoter may be selected to direct the expression of the desired polypeptide of 
the present invention. Such an expression construct may provide additional advantages since it circumvents the need to 
culture the expression hosts on a-medium containing an inducing substrate. 

Examples of suitable promoters would be LTR, SV40 and CMV in mammalian systems; E. coli lac or trp in bacterial 
systems; baculovirus polyhedron promoter (polh) in insect systems and other promoters that are known to control 
expression in eukaryotic and prokaryotic cells or their viruses. 

Examples of strong constitutive and/or inducible promoters which are preferred for use in fungal expression hosts are 
those which are obtainable from the fungal genes for xylanase (x//?A), phytase, ATP-synthetase, subunit 9 (o//C), those 
phosphate isomerase (tpi), alcohol dehydrogenase (AdhA), a -amylase (amy), amyloglucosidase (AG - from the gfaA gene), 
acetamidase (amdS) and glyceraldehyde-3-phosphate dehydrogenase (gpd) promoters. 

Examples of strong yeast promoters are those obtainable from the genes for alcohol dehydrogenase, lactase, 3- 
phosphoglycerate kinase and triosephosphate isomerase. 

Examples of strong bacterial promoters are the a-amylase and SP02 promoters as well as promoters from extracellular 
protease genes. 

Hybrid promoters may also be used to improve inducible regulation of the expression construct. 
The promoter can additionally include features to ensure or to increase expression in a suitable host. For example, the 
features can be conserved regions such as a Pribnow Box or a TATA box. The promoter may even contain other sequences 
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to affect (such as to maintain, enhance or decrease) the levels of expression of the nucleotide sequence of the present 
invention. For example, suitable other sequences include the Sh1-intron or an ADH intron. Other sequences include 
inducible elements - such as temperature, chemical, light or stress inducible elements. Also, suitable elements to enhance 
transcription or translation may be present. An example of the latter element is the TMV 5' signal sequence (see Sleat, 

5 Gene 21 7 [1987] 21 7-225; and Dawson, Plant Mol. Biol. 23 [1 993] 97). 

The expression vector may also contain sequences which act on the promoter to amplify expression. For example, the 
SV40, CMV t and polyoma cis-acting elements (enhancer) and a selectable marker can provide a phenotypic trait for 
selection (e.g. dihydrofolate reductase or neomycin resistance for mammalian cells or amplicillin/tetracyclin resistance for 
E. coti). Selection of the appropriate vector containing the appropriate promoter and selection marker is well within the level 

1 q of those skilled in the art. 

CONSTRUCTS 

The term "construct" - which is synonymous with terms such as "conjugate", "cassette" and "hybrid" - includes the 
15 nucleotide sequence according to the present invention directly or indirectly attached to a promoter. An example of an 
indirect attachment is the provision of a suitable spacer group such as an intron sequence, such as the Sh1 -intron or the 
ADH intron, intermediate the promoter and the nucleotide sequence of the present invention. The same is true for the term 
"fused" in relation to the present invention which includes direct or indirect attachment. In each case, the terms do not 
cover the natural combination of the nucleotide sequence coding for the protein ordinarily associated with the wild-type 
gene promoter and when they are both in their natural environment. 

20 

The construct may even contain or express a marker which allows for the selection of the genetic construct in, for 
example, mammalian, yeast, insect, fungal or bacterial cells. Various markers exist which may be used, such as, for 
example, those that provide for antibiotic resistance to G418, hygromycin, bleomycin, kanamycin and gentamycin. 

Preferably the construct of the present invention comprises at least the nucleotide sequence of the present invention 
operably linked to a promoter. 

25 

VECTORS 

The term "vector" includes expression vectors and transformation vectors and shuttle vectors. 
The term "expression vector" means a construct capable of in vivc or in vitro expression. 
30 The term "transformation vector" means a construct capable of being transferred from one entity to another entity - which 
may be of the same species or may be of a different species. If the construct is capable of being transferred from one 
species to another - such as from a viral vector such as MMLV or FIV to a human or mammalian primary cell or cell line, 
then the transformation vector is sometimes referred to as a "shuttle vector?. 

A large variety of expression systems may be used in different hosts. For. example, episomal, chromosomal and virus- 
^ derived systems (eig. vectors derive^ from bacterial plasmids, bacteriophage, papova virus such as SV40, vaccinia virus, 
adenovirus; and retrovirus). 

The DNA sequence can be inserted into the vector by a variety of techniques. In general the DNA sequence is inserted 
into an appropriate restriction endonuclease site by procedures known in the art and deemed to be within the scope of 
those skilled in the art. The DNA sequence in the expression vector is linked operativefy to appropriate control sequences 
40 that direct mRNA synthesis (i.e the promoter). 

The vectors of the present invention may be transformed into a suitable host cell as described below to provide for 
expression of a polypeptide of the present invention. Thus, in a further aspect, the invention provides a process for 
preparing polypeptides according to the present invention which comprises cultivating a host cell transformed or 
transfected with an expression vector as described above under conditions to provide for expression by the vector of a 
45 coding sequence encoding the polypeptides, and recovering the expressed polypeptides. 

The vectors may be, for example, plasmid, virus or bacteriophage (phage) vectors provided with an origin of replication, 
optionally a promoter for the expression of the polynucleotide and optionally a regulator of the promoter. 

The vectors of the present invention may contain one or more selectable marker genes. The most suitable selection 
systems for industrial micro-organisms are those formed by the group of selection markers which do not require a mutation 
50 in the host organism. Examples of fungal selection markers are the genes for acetamidase (amdS), ATP synthetase, 
subunit 9 (oliC), orotidine-5 , -phosphate-decarboxylase (pvrA), phleomycin and benomyl resistance (benA). Examples of 
non-fungal selection markers are the bacterial G418 resistance gene (this may also be used in mammalian cells, yeast, but 
not in filamentous fungi), the ampicillin resistance gene (£ coti), the neomycin resistance gene (mammalian cells) and 
the E. coli uidA gene, coding for ^-glucuronidase (GUS). 
55 Vectors may be used in vitro, for example for the production of RNA or used to transfect or transform a host cell. 

Thus, polynucleotides of the present invention can be incorporated into a recombinant vector (typically a replicable 
vector), for example a cloning or expression vector. The vector may be used to replicate the nucleic acid in a compatible 
host cell. Thus, in a further embodiment, the invention provides a method of making polynucleotides of the present 
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invention by introducing a polynucleotide of the present invention into a replicable vector, introducing the vector into a 
compatible host cell, and growing the host ceil under conditions which bring about replication of the vector. The vector may 
be recovered from the host cell. Suitable host cells are described below in connection with expression vectors. 

The present invention also relates to the use of genetically engineered host cells expressing a PFI-007 polypeptide or 

5 variant, homologue, fragment, analogue or derivative thereof in screening methods for the identification of modulators (e.g. 
antagonists or agonists) of PFI-007. Such genetically engineered host cells could be used to screen peptide libraries or 
organic molecules capable of modulating PFI-007 activity. Modulators (e.g. antagonists) of the PFI-007 polypeptide, such 
as antibodies, peptides or small organic molecules will provide the basis for pharmaceutical compositions for the treatment 
of diseases associated with, for example, PFI-007. Such modulators (e.g. antagonists) can be administered alone or in 

10 combination with other therapeutics for the treatment of such diseases. 

The present invention also relates to expression vectors and host cells comprising polynucleotide sequences encoding 
PFI-007 or a variant, homologue, fragment, analogue or derivative thereof for the in vivo or in vitro production of PFI-007 
protein or to screen for agents that can affect PFI-007 expression or activity. 

15 TISSUE 

The term "tissue" as used herein includes tissue perse and organ. 
HOST CELLS 

20 The term "host cell" - in relation to the present invention - includes any cell that could comprise the nucleotide sequence 
coding for the recombinant protein according to the present invention and/or products obtained therefrom, wherein a 
promoter can allow expression of the nucleotide sequence according to the present invention when present in the host 
cell. 

Thus, a further embodiment of the present invention provides host cells transformed or transfected with a polynucleotide 

25 of the present invention. Preferably said polynucleotide is carried in a vector for the replication and expression of said 
polynucleotide. The cells will be chosen to be compatible with the said vector and may, for example, be prokaryotic (for 
example, bacterial cells), or eukaryotic (i.e. mammalian, fungal, insect and yeast cells). 

Introduction of polynucleotides into host cells can be effected by methods as described in Sambrook, et al, eds. (1989) 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York, NY, USA. These methods 

30 include, but are not limited to, calcium phosphate transfection, DEAE-dextran-mediated transfection, cationic lipid-mediated 
transfection, electroporation, transvection, microinjection, transduction, scrape loading, and ballistic introduction. 

Examples of representative hosts include, bacterial cells (e.g £. coli, Steptomyces); fungal cells such as yeast cells 
arid Aspergillus; insect cells such as Drosophila S2 and Spodoptera SF9 cells; animal cells such as CHO, COS, HEK, 
HeLa, and 3T3 cells. The selection of the appropriate host is deemed to be within the scope of those skilled in the art. 

35 .{^eliding on the natar*of*he polynucleotide encoding meljotypejrfide of tfie present invention, and/or the desirability 
for farther processing of the expressed protein, eukaryotic hosts such as yeasts or other fungi may be preferred. In general, 
yeast cells are preferred over fungal cells because they are easier to manipulate. However, some proteins are either poorly 
expressed or secreted from the yeast cell, or in some cases are not processed property (e.g. hypergtycosylation in yeast). 
In these instances, a different fungal host organism should be selected. 

AO Examples of suitable expression hosts within the scope of the present invention are fungi such as Aspergillus species 

(such as those described in EP-A-01 84438 and EP-A-0284603) andTrichoderma , species; bacteria such 
as Escherichia species, Streptomyces species andPseudomonas species; and yeasts such as Kiuyveromyces species 
(such as those described in EP-A-0096430 and EP-A-0301670) and Saccharomyces species. By way of example, typical 
expression hosts may be selected from Aspergillus niger, Aspergillus niger var. tubigenis, Aspergillus niger var. awamoh, 

45 Aspergillus aculeatis, Aspergillus nidulans, Aspergillus orvzae, Trichoderma reesei, Kiuyveromyces lactis, 
Schizosaccharomyces pombe, Pichia pastons andSaccharomyces cerevisiae. 

The use of suitable host cells - such as mammalian, yeast, insect and fungal host cells - may provide for post- 
translational modifications (e.g. myristoylation, gtycosylation, truncation, lapidation and tyrosine, serine or threonine 
phosphorylation) as may be needed to confer optimal biological activity on recombinant expression products of the present 

50 invention. 

ORGANISM 

The term "organism" in relation to the present invention includes any organism, except man, that could comprise the 
nucleotide sequence coding for the recombinant protein according to the present invention and/or products obtained 
55 therefrom, wherein a promoter can allow expression of the nucleotide sequence according to the present invention when 
present in the organism. Examples of organisms may include a fungus, yeast or protozoan. 

The term "transgenic organism" in relation to the present invention includes any organism, except man, that comprises 
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the nucleotide sequence coding for the protein according to the present invention and/or products obtained therefrom, 
wherein the promoter can allow expression of the nucleotide sequence according to the present invention within the 
organism. Preferably the nucleotide sequence is incorporated in the genome of the organism. 

The term "transgenic organism" does not cover the native nucleotide coding sequence according to the present 
invention in its natural environment when it is under the control of its native promoter which is also in its natural 
environment. In addition, the present invention does not cover the native protein according to the present invention when it 
is in its natural environment and when it has been expressed by its native nucleotide coding sequence which is also in its 
natural environment and when that nucleotide sequence is under the control of its native promoter which is also in its 
natural environment. 

Therefore, the transgenic organism of the present invention includes an organism comprising any one of, or 
combinations of, the nucleotide sequence coding for the amino acid sequence according to the present invention, 
constructs according to the present invention (including combinations thereof), vectors according to the present invention, 
plasmids according to the present invention, cells according to the present invention, and tissues according to the present 
invention or the products thereof. The transformed cell or organism could prepare acceptable quantities of the desired 
compound which would be easily retrievable from the cell or organism. 

TRANSFORMATION OF HOST CELLS/HOST ORGANISMS 

As indicated earlier, the host organism can be a prokaryotic or a eukaryotic organism. An example of a suitable 
prokaryotic host is E. coti. Teachings on the transformation of prokaryotic hosts are well documented in the art, for example 
see Sambrook et at. (Molecular Cloning: A Laboratory Manual, 2nd edition, 1989, Cold Spring Harbor Laboratory Press, 
New York, NY, USA) and Ausubel et at. (Current Protocols in Molecular Biology (1995), John Wiley & Sons, Inc.). 

In a preferred embodiment, the transformed host is a mammalian cell or, for example, an insect cell, wherein 
introduction of polynucleotides into said host cells can be effected by methods as described in, for example, Sambrook et 
at. (Molecular Cloning: A Laboratory Manual, 2nd edition, 1989, Cold Spring Harbor Laboratory Press, New York, NY, USA). 
These methods include, but are not limited to, calcium phosphate transfection, DEAE-dextran-mediated transfection, 
cationic lipid-mediated transfection, electroporation, transvection, microinjection, transduction, scrape loading, and 
ballistic introduction. 

In another embodiment the transgenic organism can be a yeast. In this regard, yeast have also been widely used as a 
vehicle for heterologous gene expression. The species Saccharomyces cerevisiae has a long history of industrial use, 
including its use for heterologous gene expression. Expression of heterologous genes in Saccharomyces cerevisiae has 
been reviewed by Goodey et at (1987, Yeast Biotechnology, D R Berry et al, eds, pp 401-429, Allen and Unwin, London) 
and by King et al. (1989, Molecular and Cell Biology of Yeasts, E F Walton and G T Yarronton, eds, pp 107-133, Blackie, 
Glasgow). 

For several reasons Saccharomyces cerevisiae is well suited for heterologous gene expression. First, it is non- 
<jpamogenic to humans and it 19 incapable^of producing certain endotoxins. S^coffd, it has a long history of safe use 
following centuries cf commercial exploitation for various purposes. This haVletffo wide jsiAlic acceptabBity. Thjfrd, the 
extensive commercial use and research devoted to the organism has resulted in a wealth of knowledge about the genetics 
and physiology as well as targe-scale fermentation characteristics of Saccharomyces cerevisiae. 

A review of the principles of heterologous gene expression in Saccharomyces cerevisiae and secretion of gene products 
is given by E Hinchcliffe and E Kenny ("Yeast as a vehicle for the expression of heterologous genes", 1993, Yeasts, Vol 5, 
Anthony H Rose and J Stuart Harrison, eds, 2nd edition, Academic Press Ltd.). 

Several types of yeast vectors are available, including integrative vectors, which require recombination with the host 
genome for their maintenance, and autonomously replicating plasmid vectors. 

In order to prepare the transgenic Saccharomyces, expression constructs are prepared by inserting the nucleotide 
sequence of the present invention into a construct designed for expression in yeast. Several types of constructs used for 
heterologous expression have been developed. The constructs contain a promoter active in yeast fused to the nucleotide 
sequence of the present invention, usually a promoter of yeast origin, such as the GAL1 promoter, is used. Usually a signal 
sequence of yeast origin, such as the sequence encoding the SUC2 signal peptide, is used. A terminator active in yeast 
ends the expression system. 

For the transformation of yeast several transformation protocols have been developed. For example, a 
transgenic Saccharomyces according to the present invention can be prepared by following the teachings of Hinnen et 
at. (1 978, Proceedings of the National Academy of Sciences of the USA, 75: 1 929); Beggs, J D (1 978, Nature, London, 275: 
104); and Ito, H et at. (1983, J. Bacteriology 153:163-168). 

The transformed yeast cells are selected using various selective markers. Among the markers used for transformation 
are a number of auxotrophic markers such as LEU2, HIS4 and TRP1 , and dominant antibiotic resistance markers such as 
aminoglycoside antibiotic markers, e.g. G418. 

Thus, the present invention also provides a method of transforming a host cell with a nucleotide sequence shown in 
SEQ ID NO: 1 or a derivative, homologue, variant, analogue or fragment thereof. 
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Host cells transformed with a PFI-007 nucleotide coding sequence may be cultured under conditions suitable for the 
expression and recovery of the encoded protein (in cell membranes) from cell culture. The protein produced by a 
recombinant cell may be expressed on the cell surface, secreted or may be contained intracellulariy depending on the 
sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing PFI-007 
5 coding sequences will generally enable expression in the cell membrane. Other recombinant constructions may join PFI- 
007 coding sequence to nucleotide sequence encoding a polypeptide domain which will facilitate protein 
purification/identification (Kroll DJ et at. (1993) DNA Cell Biol Vol 12 p441-53, see also above discussion of vectors 
containing fusion proteins). 

1Q PRODUCTION OF THE POLYPEPTIDE 

According to the present invention, the production of the polypeptide of the present invention can be effected by the 
cuituring of eukaryotic or prokaryotic expression hosts, which have been transformed with one or more polynucleotides of 
the present invention, in a conventional nutrient fermentation medium. The selection of the appropriate medium may be 
based on the choice of expression hosts and/or based on the regulatory requirements of the expression construct. Such 
media are well-known to those skilled in the art. The medium may, if desired, contain additional components favouring the 
transformed expression hosts over other potentially contaminating micro-organisms. 

Thus, the present invention also provides a method for producing a polypeptide having PFI-007 activity, the method 
comprising the steps of (a) transforming a host cell with a nucleotide sequence shown in SEQ ID NO: 1 or a derivative, 
homologue, variant, analogue or fragment thereof; and (b) cuituring the transformed host cell under conditions suitable for 
20 the expression of said polypeptide. 

The present invention also relates to a method for producing a polypeptide having PFI-007 activity, the method 
^ comprising the steps of (a) cuituring a host cell that has been transformed with a nucleotide sequence shown in SEQ ID 
NO: 1 or a derivative, homologue, variant, analogue or fragment thereof under conditions suitable for the expression of said 
polypeptide; and (b) recovering said polypeptide from the host cell culture. 
25 The present invention also relates to a method for producing a polypeptide having PFI-007 activity, the method 

comprising the steps of (a) transforming a host cell with a nucleotide sequence shown in SEQ ID NO: 1 or a derivative, 
homologue, variant, analogue or fragment thereof; (b) cuituring the transformed host cell under conditions suitable for the 
expression of said polypeptide; and (c) recovering said polypeptide from the host cell culture. 

30 RIBOZYMES 

Ribozymes are enzymatic RNA molecules capable of catalysing the specific cleavage of RNA. The mechanism of 
ribozyme action involves sequence specific hybridisation of the ribozyme molecule to complementary target RNA, followed 
by endonudeolytic cleavage. Within the scope of the invention are engineered hammerhead motif ribozyme molecules that 

35 specifically and efficiency catalyse endonucieorytic cleavage, of PF£007 RNA sequences. 

Specific ribozyme cleavage sites withTft any potential RNA target are 1 initially icfcsritified by scanning the tarjgetmbtecute 
for ribozyme cleavage sites which include the following sequences: GUA, GUU and GUC. Once identified, short RNA 
sequences of between 15 and 20 ribonucleotides corresponding to the region of the target gene containing the cleavage 
site may be evaluated for secondary structural features which may render the oligonucleotide sequence inoperable. The 

40 suitability of candidate targets may also be evaluated by testing accessibility to hybridisation with complementary 
oligonucleotides using ribonuclease protection assays. 

Both antisense RNA and DNA molecules and ribozymes of the invention may be prepared by any method known in the 
art for the synthesis of RNA molecules. These include techniques for chemically syntnesising oligonucleotides such as 
solid phase phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by in vitro or in 

4$ vivo transcription of DNA sequences, encoding the antisense RNA molecule. Such DNA sequences may be incorporated 
into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, antisense cDNA 
constructs that synthesize antisense RNA constitutively or inducibly can be introduced into cell lines, celts or tissues. 



50 



DETECTION 



The presence of the PFI-007 polynucleotide coding sequence can be detected by DNA-DNA or DNA-RNA hybridisation 
or amplification using probes, portions or fragments of the sequence presented in SEQ ID NO: 1 . Nucleic acid amplification- 
based assays involve the use of oligonucleotides or oligomers based on the PFI-007 coding sequence to detect 
transformants containing PFI-007 DNA or RNA. As used herein "oligonucleotides'' or "oligomers" may refer to a nucleic acid 
sequence of at least about 10 nucleotides and as many as about 60 nucleotides, preferably about 15 to 30 nucleotides, 
55 and more preferably about 20 to 25 nucleotides which can be used as a probe or amplimer. Preferably, oligonucleotides 
are derived from the 3' region of the nucleotide sequence shown in SEQ ID NO: 1 . 

A variety of protocols for detecting and measuring the expression of PFI-007 polypeptide, such as by using either 
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polyclonal or monoclonal antibodies specific for the protein, are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA) and fluorescent activated cell sorting (FACS). A two-site, 
monoclonal-based immunoassay utilising monoclonal antibodies reactive to two non-interfering epitopes on a PFI-007 
polypeptide is preferred, but a competitive binding assay may also be employed. These and other assays are described, 

5 among other places, in Hampton R et al (1990, Serological Methods, A Laboratory Manual, APS Press. St Paul, MN, USA) 
and Maddox DE et al. (1983, J Exp Med 15 8:1211). 

A wide variety of labels and conjugation techniques are known by those skilled in the art and can be used in various 
nucleic and amino acid assays. Means for producing labelled hybridisation or PCR probes for detecting PFI-007 
polynucleotide sequences include oligolabelling, nick translation, end-labelling or PCR amplification using a labelled 

70 nucleotide. Alternatively, the PFI-007 coding sequence, or any portion of it, may be cloned into a vector for the production 
of an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA 
probes in vitro by addition of an appropriate RNA polymerase such as T7, T3 or SP6 and labelled nucleotides. 

A number of companies such as Pharmacia Biotech (Piscataway, NJ, USA), Promega (Madison, Wl, USA), and US 
Biochemical Corporation (Cleveland, OH, USA) supply commercial kits and protocols for these procedures. Suitable 

15 reporter molecules or labels include those radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents 
as well as substrates, cofactors, inhibitors, magnetic particles and the like. Patents teaching the use of such labels 
include US-A-3817837; US-A-3850752; US-A-3939350; US-A-3996345; US-A-4277437; US-A-4275149 and US-A-4366241. 
Also, recombinant immunoglobulins may be produced as shown in US-A-4816567. 

Additional methods to quantify the expression of a particular molecule include radiolabelling (Melby PC et at., 1993, J. 

2Q Immunol Methods Vol 159 p235-44) or biotinylating (Duplaa C et al, 1993, Anal Biochem Vol 229 p36) nucleotides, co- 
amplification of a control nucleic acid, and standard curves onto which the experimental results are interpolated. 
Quantification of multiple samples may be speeded up by running the assay in an ELISA format where the oligomer of 
interest is presented in various dilutions and a spectrophotometry or calorimetric response gives rapid quantification. 

Although the presence/absence of marker gene expression suggests that the gene of interest is also present, its 
presence and expression should be confirmed. For example, if the PFI-007 coding sequence is inserted within a marker 
gene sequence, recombinant cells containing PFI-007 coding regions can be identified by the absence of marker gene 
function. Alternatively, a marker gene can be placed in tandem with a PFI-007 coding sequence under the control of a 
single promoter. Expression of the marker gene in response to induction or selection usually indicates expression of PFI- 
007 as well. 

Alternatively, host cells which contain the coding sequence for PFI-007 and express PFI-007 coding regions may be 
30 identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, 
DNA-DNA or DNA-RNA hybridisation and protein bioassay or immunoassay techniques which include membrane-based, 
solution-based, or chip-based technologies for the detection and/or quantification of the nucleic acid or protein. 

ANTIBODIES 

35 

The amino acid sequence of the present invention can also he used to generate antibodies ^ such as by use of standard 
techniques - against the amino acid sequence. 

Procedures well known in the art may be used for the production of antibodies to PFI-007 polypeptides. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, Fab fragments and fragments produced by a 

40 Fab expression library. Neutralising antibodies, i.e. those which antagonise biological activity of PFI-007 polypeptides, are 
especially preferred for diagnostics and therapeutics. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, etc. may be immunised by injection 
with the PFI-007 polypeptide or any portion, variant, homologue. fragment, analogue or derivative thereof or oligopeptide 
which retains immunogenic properties. Depending on the host species, various adjuvants may be used to increase 

45 immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels such as aluminium 
hydroxide, and surface active substances such as lysolecithin, pluronic potyols, potyanions, peptides, oil emulsions, 
keyhole limpet hemocyanin, and dinitrophenol. BCG(Bacifli Calmette-Guerin) and Corynebacterium parvum are potentially 
useful human adjuvants which may be employed. 

Monoclonal antibodies to the amino acid sequence may be prepared using any technique which provides for the 

50 production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma 
technique originally described by Koehler and Milstein (1975, Nature Vol 256 p495-497), the human B^celJ hybridoma 
technique (Kosbor et al (1983) Immunol Today Vol 4 p72; Cote et al. (1983) Proceedings of the National Academy of 
Sciences (USA) Vol 80 p2026-2030) and the EBV-hybridoma technique (Cole et al. (1985) Monoclonal Antibodies and 
Cancer Therapy, Alan R Liss Inc. pp. 77-96). In addition, techniques developed for the production of "chimeric antibodies", 

55 the splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate antigen specificity 
and biological activity can be used (Morrison et at. (1984) Proceedings of the National Academy of Sciences (USA> Vol 81 
□6851-6855; Neuberger et al. (1984) Nature Vol 312 p604-608; Takeda et al (1985) Nature Vol 314 p452-454). Alternatively, 
techniques described for the production of single chain antibodies (US-A-4946779) can be adapted to produce polypeptide- 



-20- 



EP 1 092 727 A2 

specific single chain antibodies. 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population or by screening 
recombinant immunoglobulin libraries or panels of highly specific binding reagents as disclosed in Orlandi et ai (1989, 
Proceedings of the National Academy of Sciences (USA) Vol 86 p 3833-3837), and Winter G and Milstein C (1991 ; Nature 
5 Vol 349 p293-299). 

Antibody fragments which contain specific binding sites for PFI-007 may also be generated. For example, such 
fragments include, but are not limited to, the F(ab*) 2 fragments which can be produced by pepsin digestion of the antibody 
molecule and the Fab fragments which can be generated by reducing the disulphide bridges of the F(ab") 2 fragments. 
Alternatively, Fab expression libraries may be constructed to allow rapid and easy identification of monoclonal Fab 
10 fragments with the desired specificity (Huse WD et ai (1989) Science Vol 256 p1275-1281). 

An alternative technique involves screening phage display libraries where, for example the phage express scFv 
fragments on the surface of their coat with a large variety of complementarity determining regions (CDRs). This technique is 
well known in the art. 

PFI-007-specific antibodies are useful for the diagnosis of conditions and diseases associated with expression of the PFI- 
15 007 receptor. A variety of protocols for competitive binding or immunoradiometric assays using either polyclonal or 
monoclonal antibodies with established specificities are well known in the art. Such immunoassays typically involve the 
formation of complexes between PFI-007 polypeptide and its specific antibody (or similar PFI-007-binding molecule) and 
the measurement of complex formation. A two-site, monoclonal based immunoassay utilising monoclonal antibodies 
reactive to two, non-interfering epitopes on a specific PFI-007 protein is preferred, but a competitive binding assay may 
20 also be employed. These assays are described in Maddox DE et ai (1983, Journal of Experimental Medicine Vol 158 
p1211). 

Anti-PFI-007 antibodies are useful for the diagnosis of disorders involving abnormal signal transduction or other 
disorders or diseases characterised by abnormal expression of a PFI-007 receptor. Diagnostic assays for PFI-007 include 
methods utilising the antibody and a label to detect a PFI-007 polypeptide in human body fluids, cells, tissues or sections 
or extracts of such tissues. The polypeptides and antibodies of the present invention may be used with or without 
25 modification. Frequently, the polypeptides and antibodies will be labelled by joining them, either covalently or 
noncovalently, with a reporter molecule. A wide variety of reporter molecules are known to those of skill in the art. 

Antibodies may be used in method of detecting polypeptides of the invention present in biological samples by a method 
which comprises: (a) providing an antibody of the invention; (b) incubating a biological sample with said antibody under 
conditions which allow for the formation of an antibody-antigen complex; and (c) determining whether antibody-antigen 
complex comprising said antibody is formed. 

Antibodies of the invention may be bound to a solid support and/or packaged into kits in a suitable container along with 
suitable reagents, controls, instructions and the like. 

ASSAYS/IDENTIFICATION METHQDS . 

the present invention also relates to an assay method for detecting the presence of PFI-007 in cells (such as human 
cells) comprising: (a) performing a reverse transcriptase-polymerase chain reaction (RT-PCR) on RNA (such as total RNA) 
from such cells using a pair of PCR primers that are specific for PFI-007, as determined from the DNA sequence shown in 
SEQ ID NO: 1 or an allelic variation thereof; and (b) assaying the appearance of an appropriately sized PCR fragment - 
40 such as by agarose gel electrophoresis. 

There are numerous assays in which the polypeptide of the present invention can used to screen for modulators (e.g. 
antagonists or agonists) of the polypeptide. Examples of such assays include: 

Functional Assay - One example of a method for screening receptors to identify antagonists thereof is to monitor the 
inhibitory or stimulatory effect on cAMP or, adenylate cyclase accumulation. Such an assay involves transfecting a 
45 mammalian cell with the receptor of the present invention for cell surface expression. The cell is then exposed to putative 
antagonists and the amount of cAMP accumulation is measured. If the putative antagonist binds the receptor the levels of 
receptor-mediated cAMP or adenylate cyclase activity will either increase ordecrease. 

Functional Assay using a Fluorometric Imaging Plate Reader (FlipR).- A* technique used for screening and includes the 
use of cells that express the receptor of the present invention (for example, transfected HEK293 cells) in a system that 
50 measures intracellular calcium or extracellular pH changes caused by receptor activation. In this technique, cells 
expressing the receptor of the invention may be contacted with compounds (e.g. small molecules, peptides, lipids, 
nucleotides or glycoproteins) that cause a second messenger response, e.g. signal transduction, change in calcium levels 
or pH changes. These changes are used to determine whether the potential compound activates or inhibits the receptor. 
Ligand Binding Assay - This type of assay may test binding of a candidate compound, where adherence to the cells 
55 containing the receptor of the present invention is detected by means of a label directly or indirectly associated with the 
candidate compound or in an assay involving competition with a labelled competitor. Standard assays for conducting 
screens that determine if the compound activates or inhibits the receptor are well understood by those skilled in the art 
The present invention therefore also relates to a method of identifying agents (such as compounds, other substances or 
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compositions comprising the same) that affect (such as antagonise, agonise or otherwise modify) the activity of PFI-007 
and/or the expression thereof, the method comprising contacting PFI-007 or the nucleotide sequence coding for the same 
with the agent and then measuring the activity of PFI-007 and/or the expression thereof. 

The present invention also relates to a method of identifying agents (such as compounds, other substances or 

5 compositions comprising the same) that selectively affect (such as antagonise, agonise or otherwise modify) the activity of 
PFI-007 and/or the expression thereof, the method comprising contacting PFI-O07 or the nucleotide sequence coding for 
the same with the agent and then measuring the activity of PFI-007 and/or the expression thereof. 

The present invention also relates to a method of identifying agents (such as compounds, other substances or 
compositions comprising the same) that affect (such as antagonise, agonise or otherwise modify) the activity of PFI-007 

10 and/or the expression thereof, the method comprising measuring the activity of PFI-007 and/or the expression thereof in the 
presence of the agent or after the addition of the agent in: (a) a cell line into which has been incorporated recombinant DNA 
comprising the DNA sequence shown in SEQ ID NO: 1 or an allelic variation thereof, or (b) a cell population or cell line that 
naturally selectively expresses PFI-007. Preferably, the activity of PFI-007 is determined by the assay methods described 
above. 

75 The present invention also relates to a method of identifying agents (such as compounds, other substances or 

compositions comprising the same) that selectively affect (such as antagonise, agonise or otherwise modify) the activity of 
PFI-007 and/or the expression thereof, the method comprising measuring the activity of PFI-007 and/or the expression 
thereof in the presence of the agent or after the addition of the agent in: (a) a cell line into which has been incorporated 
recombinant DNA comprising the DNA sequence shown in SEQ ID NO: 1 or an allelic variation thereof, or (b) a cell 

20 population or cell line that naturally selectively expresses PFI-007. 

Preferably, the activity of PFI-007 is determined by the assay methods described above. 

The present invention also relates to a method of screening an agent for modulation (preferably for specific modulation) 
of PFI-007 (or a derivative, homologue, variant, analogue or fragment thereof) activity or the expression of the nucleotide 
sequence coding for the same (including a derivative, homologue, variant, analogue or fragment thereof), the method 

25 comprising the steps of: (a) providing a candidate agent; (b) combining PFI-007 (or the derivative, homologue, variant, 
analogue or fragment thereof) or the nucleotide sequence coding for the same (or the derivative, homologue, variant, 
analogue or fragment thereof) with the candidate agent for a time sufficient to allow modulation under suitable conditions; 
and (c) detecting modulation of the candidate agent to PFI-007 (or the derivative, homologue, variant, analogue or fragment 
thereof) or the nucleotide sequence coding for the same (or the derivative, homologue, variant, analogue or fragment 
thereof) in order to ascertain if the candidate agent modulates PFI-007 (or the derivative, homologue, variant, analogue or 

30 fragment thereof) activity or the expression of the nucleotide sequence coding for the same (or the derivative, homologue, 
variant, analogue or fragment thereof). 

The present invention also relates to a method of screening an agent for specific binding affinity with PFI-007 (or a 
derivative, homologue, variant, analogue or fragment thereof) or the nucleotide sequence coding for the same (including a 
derivative;, homologue, variant, analogue or fragment thereof), the method comprising the steps of: (a) providing a 

^ candidate ^gent; £>) combining PFItG07 (or me^ derivative, homologue, variant, analogue or fragment the/epfy or the 
nucleotide. sequence coding for the same (or the derivative, homologue. -variant, analogue or fragment thereof) with the 
candidate agent for a time sufficient to allow binding under suitable conditions; and (c) detecting binding of the candidate 
agent to PFI-007 (or the derivative, homologue, variant, analogue or fragment thereof) or the nucleotide sequence coding 
for the same (or the derivative, homologue, variant, analogue or fragment thereof) in order to ascertain if the candidate 

40 agent binds to PFI-007 (or the derivative, homologue, variant, analogue or fragment thereof) or the nucleotide sequence 
coding for the same (or the derivative, homologue, variant, analogue or fragment thereof). 

Thus, in certain embodiments of the present invention, PFI-007 or a variant, homologue, fragment, analogue or 
derivative thereof and/or a cell line that expresses the PFI-007 or variant, homologue, fragment, analogue or derivative 
thereof may be used to screen for antibodies, peptides, or other agents, such as organic or inorganic molecules, that act as 

45 modulators (e.g. antagonists or agonists) of PFI-007 activity or for the expression thereof, thereby identifying a 
therapeutic agent capable of modulating the receptor. Alternatively, screening of peptide libraries or organic libraries made 
by combinatorial chemistry with recombinantly expressed PFI-007 or a variant, homologue, fragment, analogue or derivative 
thereof or cell lines expressing PFI-007 or a variant, homologue, fragment, analogue or derivative thereof may be useful for 
identification of therapeutic agents that function by modulating the receptor. Synthetic compounds, natural products, and 

50 other sources of potentially biologically active materials can be screened in a number of ways deemed to be routine to 
those of skill in the art. For example, nucleotide sequences encoding the N-terminal region of PFI-007 may be expressed in 
a cell line, which can be used for screening of allosteric modulators, either agonists or antagonists, of PFI-007 activity. 

A PFI-007 polypeptide, its immunogenic fragments or oligopeptides thereof can be used for screening therapeutic 
compounds in any of a variety of drug screening techniques. The polypeptide employed in such a test may be free in 

55 solution, affixed to a solid support, borne on a cell surface, or located intracellular^. The formation of binding complexes 
between a PFI-007 polypeptide and the agent being tested may be measured. 

Accordingly, the present invention relates to a method for screening one or a plurality of compounds for modulation 
(preferably specific modulation, such as specific binding affinity) of PFI-007 or the expression thereof, or a portion 
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thereof or variant, homologue, fragment, analogue or derivative thereof, comprising providing one or a plurality of 
compounds; combining a PFI-007 or a nucleotide sequence coding for the same or a portion thereof or variant, homologue, 
fragment, analogue or derivative thereof with the* or each of a plurality of compounds for a time sufficient to allow 
modulation under suitable conditions; and detecting binding of a PFI-007, or portion thereof or variant, homologue, 

5 fragment, analogue or derivative thereof, to each of the plurality of compounds, thereby identifying the compound or 
compounds which modulate a PFI-007 or a nucleotide sequence coding for the same. In such an assay, the plurality of 
compounds may be produced by combinatorial chemistry techniques known to those of skill in the art. 

Another technique for drug screening provides for high throughput screening (HTS) of compounds having suitable 
binding affinity to the PFI-007 polypeptides and is based upon the method described in detail in Geysen, WO 84/03564, 

70 published on September 13, 1984. In summary, large numbers of different small peptide test compounds are synthesized 
on a solid substrate, such as plastic pins or some other surface. The peptide test compounds are reacted with PFI-007 
fragments and washed. A bound PFI-007 is then detected - such as by appropriately adapting methods well known in the 
art. A purified PFI-007 can also be coated directly onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralising antibodies can be used to capture the peptide and immobilise it on a solid support. 

15 This invention also contemplates the use of competitive drug screening assays in which neutralising antibodies capable 
of binding a PFI-007 polypeptide specifically compete with a test compound for binding a PFI-007. in this manner, the 
antibodies can be used to detect the presence of any peptide which shares one or more antigenic determinants with a PFI- 
007. 

The assay method of the present invention may be a high throughput screen (HTS). In this regard, the teachings of WO 
20 84/03564 may be adapted for the PFI-007 of the present invention. 

The teachings of US-A-5738985 may also be adapted for the assay method of the present invention. 

AGENTS 

The present invention also provides one or more agents identified by the assays methods and identification methods of 
the present invention. 

The agent of the present invention can be, for example, an organic compound or an inorganic compound. The agent can 
be, for example, a nucleotide sequence that is antisense to all or part of the sequence shown in SEQ ID NO: 1 . 

The invention further provides an agent of the present invention (or even a pharmaceutical^ acceptable salt thereof, or a 
pharmaceuticaliy acceptable solvate thereof) or a pharmaceutical composition containing any of the foregoing, for use as a 
medicament. 

The present invention also relates to the use of an agent to affect PFI-007 activity (such as to antagonise, modulate or 
agonise its GPCR activity). 

DIAGNOSTICS 

The present invention also provides a diagnostic composition for the detection of PFI-007 polynucleotide sequences. The 
diagnostic composition may comprise the sequence shown in SEQ ID NO: 1 or a variant, homologue, fragment, analogue 
or derivative thereof, or a sequence capable of hybridising to all or part of the nucleotide sequence shown in SEQ ID NO: 1 
or an allelic variation thereof. 

40 In order to provide a basis for the diagnosis of disease, normal or standard. values from a PFI-007 polypeptide expression 
should be established: This is accomplished by combining body fluids or cell extracts taken from normal subjects, either 
animal or human, with antibody to a PFI-007 polypeptide under conditions suitable for complex formation which are well 
known in the art. The amount of standard complex formation may be quantified by comparing it to a dilution series of 
positive controls where a known amount of antibody is combined with known concentrations of a purified PFI-007 

45 polypeptide. Then, standard values obtained from normal samples may be compared with values obtained from samples 
from subjects potentially affected by a disorder or disease related to a PFI-007 polypeptide expression. Deviation between 
standard and subject values establishes the presence of the disease state. 

A PFI-007 polynucleotide, or any part thereof, may provide the basis for a diagnostic and/or a therapeutic compound. 
For diagnostic purposes, PFI-007 polynucleotide sequences may be used to detect and quantify gene expression in 

50 conditions, disorders or diseases in which PFI-007 activity may be implicated. 

PFI-007-encoding polynucleotide sequence may be used for the diagnosis of diseases resulting from expression of PFI- 
007. For example, polynucleotide sequences encoding PFI-007 may be used in hybridisation or PCR assays of tissues from 
biopsies or autopsies or biological fluids, such as serum, synovial fluid or tumour biopsy, to detect abnormalities in PFI- 
007 expression. The form of such qualitative or quantitative methods may include Southern or northern analysis, dot blot or 

55 other membrane-based technologies; PCR technologies; dip stick, pin or chip technologies; and ELISA or other multiple 
sample format technologies. All of these techniques are well known in the art and are, in fact, the basis of many 
commercially available diagnostic kits. 

Such assays may be tailored to evaluate the efficacy of a particular therapeutic treatment regime and may be used in 
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animal studies, in clinical trials, or in monitoring the treatment of an individual patient. In order to provide a basis for 
the diagnosis of disease, a normal or standard profile for PFI-007 expression should be established. This is accomplished 
by combining body fluids or cell extracts taken from normal subjects, either animal or human, with PFI-007 or a portion 
thereof, under conditions suitable for hybridisation or amplification. Standard hybridisation may be quantified by comparing 

5 the values obtained for normal subjects with a dilution series of positive controls run in the same experiment where a known 
amount of purified PFI-007 <js used. Standard values obtained from normal samples may be compared with values obtained 
from samples from subjects potentially affected by a disorder or disease related to expression of the PFI-007 coding 
sequence. Deviation between standard and subject values establishes the presence of the disease state. If disease is 
established, an existing therapeutic agent is administered, and treatment profile or values may be generated. Finally, the 

10 assay may be repeated on a regular basis to evaluate whether the values progress toward or return to the normal or 
standard pattern. Successive treatment profiles may be used to show the efficacy of treatment over a period of several days 
or several months.. 

Thus, the present invention relates to the use of a PFI-007 polypeptide, or variant, homologue, fragment, analogue or 
derivative thereof, to produce anti-PFI-007 antibodies which can, for example, be used diagnostically to detect and quantify 
j 5 PFI-007 levels in disease states. 

The present invention further relates to diagnostic assays and kits for the detection of PFI-007 in cells and tissues 
comprising a purified PFI-007 which may be used as a positive control, and anti-PFI-007 antibodies. Such antibodies may 
be used in solution-based, membrane-based, or tissue-based technologies to detect any disease state or condition related 
to the expression of PFI-007 protein or expression of deletions or a variant, homologue, fragment, analogue or derivative 
thereof. 

20 

PROBES 

Another aspect of the subject invention is the provision of nucleic acid hybridisation or PCR probes which are capable of 
detecting polynucleotide sequences, including genomic sequences, encoding PFI-007 coding region or closely related 

25 molecules, such as alleles. The specificity of the probe, i.e. whether it is derived from a highly conserved, conserved or 
non-conserved region or domain, and the stringency of the hybridisation or amplification (high, intermediate or tow) will 
determine whether the probe identifies only naturally occurring PFI-007 coding sequence, or related sequences. Probes for 
the detection of related nucleic acid sequences are selected from conserved or highly conserved nucleotide regions of PFI- 
007 polynucleotides, such as the 3' region, and such probes may be used in a pool of degenerate probes. For the 

30 detection of identical nucleic acid sequences, or where maximum specificity is desired, nucleic acid probes are selected 
from the non-conserved nucleotide regions or unique regions of PFI-007 polynucleotides. As used herein, the term "non- 
conserved nucleotide region" refers to a nucleotide region that is unique to the PFI-007 coding sequence disclosed herein 
and does not occur in related sequences. 

PCR, as described in US-A-4683195, US-A-4800195 and US-A-4965188 provides additional uses for oligonucleotides 

35 based upon the PFI-007 sequence, Such oligomers ar^. generally chemical^synthesizedi but they may J»e generated* 
enzymatically or produced from a recombinant source, dtigortSers generally comprise two nucleotide sequences, one with; 
sense orientation (5'->3^ and one with antisense (3'<-S0 employed under optimised conditions for identification of a 
specific gene or condition. The same two oligomers, nested sets of oligomers, or even a degenerate pool of oligomers may 
i be employed under less stringent conditions for detection and/or quantification of closely related DNA or RNA sequences. 

40 The nucleic acid sequence for PFI-007 can also be used to generate hybridisation probes as previously described, for 
mapping the endogenous genomic sequence. The sequence may be mapped to a particular chromosome or to a specific 
region of the chromosome using well known techniques. These include in situ hybridisation to chromosomal spreads 
(Verma et at. (1988) Human Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York City, USA), flow- 
sorted chromosomal preparations, or artificial chromosome constructions such as yeast artificial chromosomes (YACs), 

45 bacterial artificial chromosomes (BACs), bacterial PI constructions or single chromosome cDNA libraries. 

In situ hybridisation of chromosomal preparations and physical mapping techniques such as linkage analysis using 
established chromosomal markers are invaluable in extending genetic maps. Examples of genetic maps can be found in 
Science (1995; 270:41 Of and 1994; 265:1981 f). Often the placement of a gene on the chromosome of another mammalian 
species may reveal associated markers even if the number or arm of a particular human chromosome is not known. New 

^ sequences can be assigned to chromosomal arms, or parts thereof, by physical mapping. This provides valuable 
information to investigators searching for disease genes using positional cloning or other gene discovery techniques. Once 
a disease or syndrome, such as ataxia telangiectasia (AT), has been crudely localised by genetic linkage to a particular 
genomic region, for example, AT to Hq22-23 (Gatti et al (1988) Nature 336:577-580), any sequences mapping to that area 
may represent associated or regulatory genes for further investigation. The nucleotide sequence of the subject invention 
may also be used to detect differences in the chromosomal location due to translocation, inversion, etc. between normal, 

^ carrier or affected individuals. 
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The present invention also provides a pharmaceutical composition for treating an individual in need of the same due to 
PFI-007 activity, the composition comprising a therapeutically effective amount of an agent that modulates (such as 
antagonises or agonises) said activity and a pharmaceutical^ acceptable carrier, diluent, excipient or adjuvant 
5 Thus, the present invention also covers pharmaceutical compositions comprising the agents of the present invention (an 

agent capable of modulating the expression pattern of the nucleotide sequence of the present invention or the activity of 
the expression product thereof and/or an agent identified by an assay according to the present invention). In this regard, 
and in particular for human therapy, even though the agents of the present invention can be administered alone, they will 
generally be administered in admixture with a pharmaceutical carrier, adjuvant, excipient or diluent selected with regard to 
10 the intended route of administration and standard pharmaceutical practice. 

By way of example, in the pharmaceutical compositions of the present invention, the agents of the present invention may 
be admixed with any suitable binder(s), lubricant(s), suspending agent(s), coating agent(s), or solubilising agent(s). 

In general, a therapeutically effective daily oral or intravenous dose of the agents of the present invention is likely to 
range from 0.01 to 50 mg/kg body weight of the subject to be treated, preferably 0.1 to 20 mg/kg. The agents of the present 
15 invention may also be administered by intravenous infusion, at a dose which is likely to range from 0.001-10 mg/kg/hr. 

Thus, the present invention also provides a method of treating an individual in need of the same due to PFI-007 activity 
comprising administering to said individual an effective amount of the pharmaceutical composition of the present invention. 

Typically, the physician will determine the actual dosage which will be most suitable for an individual patient and it will 
vary with the age, weight, sex and response of the particular patient. The above dosages are exemplary of the average 
20 case. There can, of course, be individual instances where higher or lower dosage ranges are merited, and such are within 
the scope of this invention. 

Where appropriate, the pharmaceutical compositions can be administered by inhalation, in the form of a suppository or 
pessary, topically in the form of a lotion, solution, cream, ointment or dusting powder, by use of a skin patch, orally in 
the form of tablets containing excipients such as starch or lactose, or in capsules or ovu|es either alone or in admixture 
2 5 with excipients, or in the form of elixirs, solutions or suspensions containing flavouring or colouring agents, or they can be 
injected parenterally, for example intracavernosally, intravenously, intramuscularly or subcutaneously. For parenteral 
administration, the compositions may be best used in the form of a sterile aqueous solution which may contain other 
substances, for example enough salts or monosaccharides to make the solution isotonic with blood. For buccal or 
sublingual administration the compositions may be administered in the form of tablets or lozenges which can be formulated 
in a conventional manner. 

For oral, parenteral, buccal and sublingual administration to subjects (such as patients), the dairy dosage level of the 
agents of the present invention may typically be from 10 to 500 mg (in single or divided doses). Thus, and by way of 
example, tablets or capsules may contain from 5 to 100 mg of active agent for administration singly, or two or more at a 
time, as appropriate. It is also possible to administer the agents of the present invention in sustained release formulations. 

In some applications, generally in humans, oral administration of the agents of the present invention is the preferred 
route, betrig the most .©o^enient and can: in seme cashes avoid: ^sadvaritsges associated vytth other routed of 
administration - such as those associated with intracavemoSal (Lc.) admmistratipn. In circurnstances where the.fecipient 
suffers from a swallowing disorder or from impairment of drug absorption after oral administration, the drug may be 
administered parenterally, sublingually or buccally. , 

For veterinary use, the agent of the present invention is typically administered as a suitably acceptable formulation in 
accordance with normal veterinary practice and the veterinary surgeon will determine the dosing regimen and route of 
administration which will be most appropriate for a particular animal. However, as with human treatments, it may be 
possible to administer the agent alone for veterinary treatments. 

Typically, the pharmaceutical compositions - which may be for human or animal usage - will comprise any one or more 
of a pharmaceutical^ acceptable diluent, carrier, excipient or adjuvant The choice of pharmaceutical carrier, excipient, 
45 adjuvant or diluent can be selected with regard to the intended route of, administration and standard pharmaceutical 
practice. As indicated above, the pharmaceutical compositions may comprise as - or in addition to - the carrier, excipient, 
adjuvant or diluent any suitable binder(s), lubricant(s), suspending agent (s), coating agent(s) or solubilising agent(s). 

In some embodiments of the present invention, the pharmaceutical compositions will comprise one or more of: an agent 
that has been screened by an assay of the present invention; an agent that is capable of interacting with SEQ ID NO: 1 or 
50 SEQ ID NO: 2 including derivatives, fragments, homolpgues, analogues or variants thereof or sequences capable of 
hybridising to the nucleotide sequence shown in SEQ ID NO: 1. . 

Included in the scope of the invention are oligonucleotide sequences, antisense RNA and DNA molecules and 
ribozymes, which function to destabilise PFI-007 mRNA or inhibit translation of PFI-007. 
A PFI-007 antisense molecule may provide the basis for treatment of various abnormal conditions related to, for example, 
55 increased PFI-007 activity. 

Expression vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, or from various bacterial plasmids, 
may be used for delivery of recombinant PFI-007 sense or antisense molecules to the targeted cell population. Methods 
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which are well known to those skilled in the art can be used to construct recombinant vectors containing PFI-007. 
Alternatively, recombinant PFI-007 can be delivered to target cells in liposomes. 

The full length cDNA sequence and/or its regulatory elements enable researchers to use PFI-007 as a tool in sense 
(Youssoufian H and HF Lodish (1993) Mol Cell Biol 13:98-104) or antisense (Eguchi et ai (1991) Annu Rev Biochem 60:631- 
652) investigations of gene function. Oligonucleotides, designed from the cDNA or control sequences obtained from the 
genomic DNA can be used in vitro or in vivo to inhibit expression. Such technology is now well known in the art, and sense 
or antisense oligonucleotides or larger fragments can be designed from various locations along the coding or control 
regions. Appropriate oligonucleotides, which can be 20 nucleotides in length, may be used to isolate PFI-007 sequences or 
closely related molecules from human libraries. 

Additionally, PFI-007 expression can be modulated by transfecting a cell or tissue with expression vectors which express 
high levels of a PFI-007 fragment in conditions where it would be preferable to block PFI-007 activity. Such constructs can 
flood cells with untranslatable sense or antisense sequences. Even in the absence of integration into the DNA, such 
vectors may continue to transcribe RNA molecules until all copies of the vector are disabled by endogenous nucleases. 
Such transient expression may last for a month or more with a non-replicating vector and even longer if appropriate 
replication elements are part of the vector system. 

Modifications of gene expression can be obtained by designing antisense sequences to the control regions of the PFI- 
007 gene, such as the promoters, enhancers, and introns. 

Oligonucleotides derived from the transcription initiation site, e.g. between -10 and +10 regions of the leader sequence, 
are preferred. Antisense RNA and DNA molecules may also be designed to block translation of mRNA by preventing the 
transcript from binding to ribosomes. Similarly, inhibition can be achieved using Hogeboom base-pairing methodology, also 
known as "triple helix" base-pairing. Triple helix pairing compromises the ability of the double helix to open sufficiently 
for the binding of polymerases, transcription factors, or regulatory molecules. 

Thus the invention provides a pharmaceutical composition comprising an agent of the present invention (or even a 
pharmaceutical^ acceptable salt thereof, or a pharmaceutical^ acceptable solvate thereof) together with a 
pharmaceutical^ acceptable diluent, adjuvant, excipient or carrier. 

The pharmaceutical composition could be for veterinary (i.e. animal) usage or for human usage. 

Thus, the present invention therefore also relates to pharmaceutical compositions comprising effective amounts of 
modulators (e.g. antagonists or agonists) of PFI-007 protein (including antisense nucleic acid sequences) in admixture with 
a pharmaceutical^ acceptable diluent carrier, excipient or adjuvant (including combinations thereof). 

The present invention relates to pharmaceutical compositions which may comprise all or portions of PFI-007 
polynucleotide sequences, PFI-007 antisense molecules, PFI-007 polypeptides, protein, peptide or organic modulators of 
PFI-007 bioactivity, such as antagonists (including antibodies) or agonists, alone or in combination with at least one other 
agent, such as stabilising compound, and may be administered in any sterile, biocompatible pharmaceutical carrier, 
including, but not limited to, saline, buffered saline; dextrose, and water. 

GENERAL METMOpdjCOGY REf£R£N<M& 

Although in general the techniques mentioned herein are well known in the art, reference may be made in particular to 
Sambrook etal, Molecular Cloning, A Laboratory Manual (1989) and Ausubel et a/., Short Protocols in Molecular Biology 
(1999) 4 ttI a Ed, John Wiley & Sons, Inc. PCR is described in US-A-4683195, US-A-4800195 and US-A-4965188. 

DEPOSITS 

The following sample was deposited in accordance with the Budapest Treaty at the recognised depositary The National 
Collections of Industrial and Marine Bacteria Limited (NCIMB) at 23 St. Machar Drive, Aberdeen, Scotland, AB2 1RY, United 
Kingdom on 23 August 20O0: ' 

NCIMB number NCIMB 41068 is Escherichia coil Pfi-007. 

The depositor was Pfizer Central Research, Pfizer Limited, Ramsgate Road, Sandwich, Kent, CT13 9NJ, United 
Kingdom. 

One skilled in the art could readily grow the above-mentioned E. coii clone (NCIMB 41068) in Luria Broth containing 
ampicillin and isolate the plasmid DNA of the clone using the alkali lysis method as described in Sambrook, et al. t eds. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York, NY, USA. The di-deoxy 
termination method as described by Sanger et ai (Proceedings of the National Academy of Sciences (USA) (Dec. 1977), 
74(12):5463-5467) and modified by Applied Biosystems (see Applied Biosystems manufacturer's literature) for fluorescent 
detection could then be used to sequence the DNA and identify PFI-007. 

The present invention also encompasses sequences derivable and/or expressable from that deposit and embodiments 
comprising the same. The present invention also encompasses partial sequences derivable and/or expressable from that 
deposit and embodiments comprising the same, wherein those partial sequences code for active polypeptides. The present 
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invention also encompasses proteins comprising sequences derivable and/or expressable from that deposit and 
embodiments comprising the same. The present invention also encompasses proteins comprising partial sequences 
derivable and/or expressable from that deposit and embodiments comprising the same, wherein those partial sequences 
code for active polypeptides. 

5 

INTRODUCTION TO THE EXAMPLES SECTION, THE FIGURES AND THE SEQUENCE LISTING 

The present invention will now be described, by way of example only, with reference to the accompanying Figures and 
Sequence Listing in which :- 

10 Figure 1 shows quantitative PCR tissue distribution data for PFI-007 using a panel of human tissue cDNA. 
Figure 2 shows PCR tissue distribution data for PFI-007 on a cDNA panel Origene (RAPID scan). 
Figure 3 shows a ClustalW Alignment of PFI-007 with SW|P41596|DOP1_DROME DOPAMINE RECEPTOR 1 
PRECURSOR (D-DOP1). 

SEQ ID NO: 1 shows the nucleotide sequence coding for PFI-007. The ATG translation initiation codon is indicated by 
15 the first three letters. The stop codon is indicated by the last three letters. 

SEQ ID NO: 2 shows the corresponding amino acid sequence coding for PFI-007. 
SEQ ID NOS: 3-9 list the various primers used throughout the Examples. 

The polynucleotide which encodes the GPCR of the present invention was cloned and the DNA and amino acid 
sequences analysed using various bioinformatic tools. The GPCR encoded by the sequences described herein has been 
20 termed PFI-007. 

Examples 

IDENTIFICATION OF PFI-007 



25 
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PFI-007 was identified in unannotated genomic sequence information which was released by the Genome Sequencing 
Centers by searching the sequences with known members of the G-protein coupled receptor (GPCR) family using the 
BLAST algorithm. 

ISOLATION OF PFI-007 

Using the genomic sequence information, PCR primers were designed to amplify the complete open reading frame 
using 500ng of human genomic DNA as the template. The primer sequences were as follows:- 



35 Sense - y ACGATGGAATCATCTTTCTCATTTQG 3' 



1 



Antisense - 5' TTAGCCATCAAACTCTGAGCTG 3* 



PCR was earned out using the following PCR conditions:- 
94°C - 5 minutes - 1 cycle 
45 94*C- 1.5 minutes 
55°C - 1 .5 minutes 
68*C - 2.5 minutes - 35 cycles 
68*C - 10 minutes - 1 cycle 

The PCR products were cloned into pCDNA3.1 using the TOPO cloning system (Invitrogen, San Diego, CA, USA) and 
^ sequence-validated using dideoxy termination fluorescent chemistry and an ABI377 DNA sequence analyser. The resulting 
sequence-validated clone was deposited at the NCIMB (Deposit No. 41068). 

TISSUE DISTRIBUTION 

To determine the distribution of the mRNA that encodes PFI-007, a number of approaches were performed. 

55 

(a) Quantitative PCR tissue distribution using a panel of human tissue cDNA and Taqman ABI 7700 technology 
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Primers and Taqman probe sequences were identified using the primer express software version 1.0 (Applied 
Biosystems). The sequences used are detailed below:- 

Ftaq- 5' CCATGGCTGGAGGTTATCGA 3' (PFI-007 Forward TaqMan primer) 



Rtaq- 5'GCTCCCAATGAGAACAGACACA 3* (PFI-007 Reverse TaqMan primer) 

10 

CP-5' FAM-CCCCACGGACTCCCAGCGACTT-TAMRA 3' (PFI-007 Coding strand probe, 
15 FAM reporter TAMRA quencher; = PF1-007CP) 




All methods used were as per manufacturers' protocols, except for the following modifications:- 
A standard curve reaction was used to quantify PFI-007 expression using a 1.48 ug/ul plasmid solution of pcDNA3.1 
20 containing PFI-007. The plasmid solution was serially diluted in log steps to 10~ 12 a. The standard curve was set up using 
this dilution for 2-200,000 copies/well. 300 nM of each primer and 150 nM of PFI-007CP were used with 2x TaqMan® 
Universal PCR Master Mix (Applied Biosystems). 1 ng of cDNA from Multiple Tissue cDNA (MTC™a) Panels; Human I and 
Human II (Clontech) was used as the template. The reaction was run on the TaqMan® 7700 Sequence Detection System 
as a single reporter, real time quantitation with 25 pi reaction volume and every point in triplicate. PCR conditions were, 
25 50°C for 2 min, 95*C for 10 min (95*C for 15 sec, 60'C for 1 min) x 40 cycles. The Data was analysed by the sequence 
detection software. Baseline 3-18 cycles. The results file was exported to Microsoft Excel and the standard curve 
regenerated with the Detection parameters from the results file. 

The results (see Figure 1) show that PFI-007 is expressed predominately in the pancreas (10 copies) and expressed at 
lower levels in testis and ovary (-1 copy) and not detectable elsewhere. 

30 

(b) PCR tissue distribution on a cDNA panel Origene (RAPID scan) including human isolated pancreatic Islet cells. 

PCR tissue distribution on a cDNA panel Origene (RAPID scan) including human isolated pancreatic Islet cells was 
carried out using the following primers and the PCR conditions as described above. 

35 RT-sense5* ^ 



RT-antisense 5' TAGGCATAGATGAGTGGGTTGAGC 3' 



As can be seen in Figure 2, an expected fragment size of 500 bp was obtained and order of relative abundance was 
45 scored: Human pancreatic Islet cells = fetal liver > pancreas > testis > peripheral blood leukocytes. 



BIOINFORMATIC STUDIES 



50 



In order to confirm that PFI-007 was a member of the GPCR family, a number of bioinformatics approaches were 
performed. 

(a) BLAST Search against Swissprot 



PFI-007 was searched against Swissprot using the BLAST algorithm (Basic Local Alignment Search Tool (Aftshul SF 
(1993) J.Mol. EvoL 36:290-300; Altshul, SF et al (1990) J. Mol. Biol. 215:403-410)) to identify the closest protein match. 
In this case the top hit was to: 

SW|P41596|DOP1_DROME DOPAMINE RECEPTOR 1 PRECURSOR (D-DOP1). 
These results indicate that PFI-007 is a member of the GPCR family. 
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« 

(b) ClustalW Alignment of PFI-007 with SW|P41596|DOP1_DROME DOPAMINE RECEPTOR 1 PRECURSOR (D-DOP1) 

These results are shown in Figure 3. 

5 

(c) BLAST search against a non-redundant human GPCR database 

PFI-007 was searched against a non-redundant human GPCR database comprising mainly sequences from Genbank 
and Geneseq Patents databases in order to identify the class of agonist for this receptor. The top ten hits are shown 
10 below: 

Y08756 5-HT-4 :- Expect = 6e-24 Identities = 99/354 (27%) 
P07550 B2AR :- Expect = 6e-24 Identities = 80/297 (26%) 
P50406 5-HT6 Expect = 2e-23 Identities = 85/304 (27%) 
P21728 D1DR :- Expect = 5e-23 Identities = 84/320 (26%) 
P30542 AA1R :- Expect = 5e-22 Identities = 77/292 (26%) 
P08588 B1AR :- Expect = 7e-21 Identities = 77/320 (24%) 
P20521 HH2R :- Expect = 1e-20 Identities = 70/283 (24%) 
L36150 GPR6 :- Expect = 1e-19 Identities = 80/289 (27%) 
P29274 AA2A :- Expect = 2e-1 9 Identities = 90/308 (29%) 

P21918 D5DR :- Expect = 3e-1 9 Identities = 82/322 (25%). (e value = statistical likelihood of the hit occurring by chance) 
These results demonstrate that PFI-007 is potentially an aminergic receptor and that PFI-007 encodes a novel GPCR 

whose ligand is a small molecule, such as an amine. 

It will be appreciated that the foregoing is provided by way of example only and modification of detail may be made 

without departing from the scope of the invention. 

25 
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SEQUENCE LISTING 
(which forms part of the description) 



SEQ ID NO; 1 



Nucleotide sequence coding for PFI-007 



ATGGAATCATCTTTCTCATTTGGAGTGATCCTTGCTGTCCTGGCCTCCCTCATCATT 
GCTACTAACACACTAGTGGCTGTGGCTGTGCTGCTGTTGATCCACAAGAATGATGGT 
GTCAGTCTCTGCTTCACCTTGAATCTGGCTGTGGCTGACACCTTGATTGGTGTGGCC 
ATCTCTGGCCTACTCACAGACCAGCTCTCCAGCCCTTCTCGGCCCACACAGAAGACC 
CTGTGCAGCCTGCGGATGGCATTTGTCACTTCCTCCGCAGCTGCCTCTGTCCTCACG 
GTCATGCTGATCACCTTTGACAGGTACCTTGCCATCAAGCAGCCCTTCCGCTACTTG 
AAGATCATGAGTGGGTTCGTGGCCGGGGCCTGCATTGCCGGGCTGTGGTTAGTGTCT 
TACCTCATTGGCTTCCTCCCACTCGGAATCCCCATGTTCCAGCAGACTGCCTACAAA 
GGGCAGTGCAGCTTCTTTGCTGTATTTCACCCTCACTTCGTGCTGACCCTCTCCTGC 
GTTGGCTTCTTCCCAGCCATGCTCCTCTTTGTCTTCTTCTACTGCGACATGCTCAAG 
ATTGCCTCCATGCACAGCCAGCAGATTCGAAAGATGGAACATGCAGGAGCCATGGCT 
GGAGGTTATCGATCCCCACGGACTCCCAGCGACTTCAAAGCTCTCCGTACTGTGTCT 
GTTCTCATTGGGAGCTTTGCTCTATCCTGGACCCCCTTCCTTATCACTGGCATTGTG 
CAGGTGGCCTGCCAGGAGTGTCACCTCTACCTAGTGCTGGAACGGTACCTGTGGCTG 
CTCGGCGTGGGCAACTCCCTGCTCAACCCACTCATCTATGCCTATTGGCAGAAGGAG 
GTGCGACTGCAGCTCTACCACATGGCCCTAGGAGTGAAGAAGGTGCTCACCTCATTC 
CTCCTCTTTCTCTCGGCCAGGAATTGTGGCCCAGAGAGGCCCAGGGAAAGTTCCTGT 
CAGATCGTCACTATCTGCAGCTC^ 



SEQ ID NO: 2 



Amino acid sequence coding for PFI-007 



MESSFSFGVILAVLASL1IATNTLVAVAVLLLIHKNDGVSLCFTLNLAVADTLI 

GVA1SGLLTDQLSSPSRPTQKTLCSLRMAFVTSSAAASVLTVMLITFDRYLAIK 

QPFRYLKIMSGFVAGAC1AGLWLVSYLIGFLPLGIPMFQQTAYKGQCSFFAVF 

HPHFVLTLSCVGFFPMLLFVFFYCDMLKJASMHSQQTRKMEHAGAMAGGYRS 

PRTPSDFKALRTVSVLIGSFALSWTPFLlTGrVQVACQECHLYLVLERYLWLLG 

VGNSLLNPL1YAYWQKEVRLQLYHMALGVKKVLTSFLLFLSARNCGPERPRE 

SSCHTVTISSSEFDG* 
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SEQ ID NO. 3 

ACCATGGAAT CATCTTTCTC ATTTGG 
SEQ ID NO. 4 

TTAGCCATCA AACTCTGAGC TG 
SEQ ID NO. 5 

CCATGGCTGG AGGTTATCGA 
SEQ ID NO. 6 

GCTCCCAATG AGAACAGACA CA 
SEQ ID NO. 7 

CCCCACGG&C TCC£AGCSaC TT 
SEQ ID NO. 8 

GCCGGGCTGT GGTTAGTGTC 
SEQ ID NO. 9 

TAGGCATAGA TGAGTGGGTT GAGC 
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Claims 

1. An isolated and/or purified polynucleotide comprising one or more of: 

(a) a polynucleotide encoding the polypeptide as set forth in SEQ ID NO: 2; 

(b) a polynucleotide comprising a nucleotide sequence of SEQ ID NO: 1; 

(c) a polynucleotide encoding the polypeptide expressed by the DNA contained in NCIMB 41068; 

(d) a polynucleotide comprising a nucleotide sequence that has at least 70% identity to the polynucleotide of any 
one of (a) to (c); 

(e) a polynucleotide comprising a nucleotide sequence which is capable of hybridising to the polynucleotide of any 
15, one of (a) to (d); 

(f) a complement to the polynucleotide of any one of (a) to (e); or 

(g) a polynucleotide fragment of the polynucleotide of any one of (a) to (f). 

20 

2. The polynucleotide of claim 1 , comprising a nucleotide sequence that has at least 75% identity to the polynucleotide of 
any one of (a) to (c). 

25 3. The polynucleotide of claim 1 , comprising a nucleotide sequence that has at least 80% identity to the polynucleotide of 
any one of (a) to (c). 

4. The polynucleotide of claim 1 , comprising a nucleotide sequence that has at least 85% identity to the polynucleotide of 
any one of (a) to (c). 
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5. The polynucleotide of claim 1 , comprising a nucleotide sequence that has at least 90% identity to the polynucleotide of 
any one of (a) to (c). 

6. The polynucleotide of.claim 1 , comprising a nucleotide sequence that has at least 95% identity to the polynucleotide of 
any one of (a) to (c). 

7. The polynucleotide of any one of the prec^rig&laims which encodes a G-protein coupled receptor (GPCR). 

8. A polynucleotide probe or primer comprising at least 1 5 contiguous nucleotides of the polynucleotide of any one of the 
preceding claims. 



9. A vector comprising the polynucleotide of any one of the preceding claims. 

10. A host cell transformed or transfected with the vector of claim 9. 

45 11. The transformed/transfected host cell of claim 10 which is a mammalian, insect, fungal, bacterial or yeast cell. 

12 The transcribed RNA product of the polynucleotide of any one of claims 1 to 8. 



13. An RNA molecule or a fragment thereof which is antisense in relation to the RNA product of claim 12 and is capable 
of hybridising thereto. 

14. A ribozyme or zinc finger protein capable of binding to the polynucleotide of any one of claims 1 to 8. 

15. A process for producing a polypeptide or fragment thereof comprising culturing the transformed/transfected host cell 
of claim 10 or claim 1 1 under conditions sufficient for the expression of said polypeptide or fragment. 

16. The process of claim 1 5, wherein said polypeptide or fragment is expressed at the surface of said cell. 
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17. The process of claim 15 or claim 16 which further includes recovering the polypeptide or fragment from the culture. 

18. A process for producing cells capable of expressing a polypeptide or fragment thereof comprising transforming or 
transfecting cells with the vector of claim 9. 

19. Cells produced by the process of claim 18. 

20. A membrane preparation of the cells of claim 19. 

21. A polypeptide or a fragment thereof produced by the process of any one of claims 1 5 to 18. 

22. A polypeptide comprising: 

(a) a polypeptide having the deduced amino acid sequence translated from the polynucleotide sequence in SEQ 
ID NO: 1 and variants, fragments, homologues, analogues and derivatives thereof; 

(b) a polypeptide of SEQ ID NO: 2 and variants, fragments, homologues, analogues and derivatives thereof; or 

(c) a polypeptide encoded by the cDNA of NCIMB 41068 and variants, fragments, homologues, analogues and 
derivatives thereof. 



23. An antibody against the polypeptide of claim 22. 

24. A compound which modulates the polypeptide of claim 22. 

25. A compound according to claim 24 which antagonises or selectively antagonises the polypeptide of claim 22. 

26. A compound according to claim 24 which agonises the polypeptide of claim 22. 

27. A pharmaceutical composition comprising the antibody of claim 23 or the compound of any one of claims 24 to 26 and 
one or more pharmaceutical^ acceptable carriers, diluents, adjuvants or excipients. 

28. A method for identifying a compound which binds to and modulates the polypeptide of claim 22 comprising contacting 
said polypeptide with a candidate compound and determining whether modulation occurs. 

29. A mefiod^ocp^ding; to claim 28, which comprises: 

(a) contacting a compound with cells expressing the polypeptide of claim 22 on their surface, said polypeptide 
being associated with a second component capable of providing a detectable signal in response to the binding of 
a compound to said polypeptide; said contacting being under conditions sufficient to permit binding of compounds 
to the polypeptide; and 

(b) identifying a compound capable of polypeptide binding by. detecting the signal produced by said second 
component. 



30. A method according to claim 29, which comprises: 

(a) contacting (i) a detectable first component known to bind to the polypeptide of claim 22 and (ii) a compound, 
with cells expressing the polypeptide of claim 22 on their cell surface, said polypeptide being associated with a 
second component capable of providing a detectable signal in response to the binding of a compound to said 
polypeptide; said contacting being under conditions sufficient to permit binding of compounds to the polypeptide; 
and 

(b) determining whether the first component binds to the polypeptide by detecting the absence or otherwise of a 
signal generated from the interaction of the first component with the polypeptide. 

31. A method according to any one of claims 28 to 30, wherein said compound binds to and antagonises or selectively 
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antagonises the polypeptide of claim 22. 

32. A method according to any one of claims 28 to 30, wherein said compound binds to and agonises the polypeptide of 
claim 22. 

* 

5 

33. The antibody of claim 23, the compound of any one of claims 24 to 26, or the composition of claim 27 for use as a 
pharmaceutical. j 

34. Use of the compound of any one of claims 24 to 26 in the manufacture of a medicament for the treatment of a patient 
1Q having need to modulate the polypeptide of claim 22. 

35. Use according to claim 34, in the manufacture of a medicament for the treatment of a patient having need to 
antagonise or selectively antagonise the polypeptide of claim 22. 



/5 



36. Use according to claim 34, in the manufacture of a medicament for the treatment of a patient having need to agonise 
the polypeptide of claim 22. 

37. A method for the treatment of a patient having need to modulate the polypeptide of claim 22 comprising administering 
to the patient a therapeutically effective amount of the compound of any one of claims 24 to 26. 

20 38. The method according to claim 37, wherein the method is for the treatment of a patient having need to antagonise or 
selectively antagonise the polypeptide of claim 22. 



25 



39. The method according to claim 37, wherein the method is for the treatment of a patient having need to agonise the 
polypeptide of claim 22. 

40. The method according to any one of claims 37 to 39, wherein said compound is a polypeptide and a therapeutically 
effective amount of the compound is administered by providing to the patient DNA encoding said compound and 
expressing said compound//? vivo. 

30 41. Use of the antibody of claim 23 in the manufacture of a medicament for the treatment of a patient having need to 
modulate the polypeptide of claim 22. 

42. Use according to claim 41 , in the manufacture of a medicament for the treatment of a patient having need to 
antagonise or selectively antagonise the polypeptide of claim 22. 

■|5 - ' ■ <v • • , ; 

43. Use;accbrding; to'claim 41, in the manufacture of a medicament for the-teaiment M a patient having need to agonise 

the polypeptide of claim 22. 

44. A method for the treatment of a patient having need to modulate the polypeptide of claim 22, comprising 
administering to the patient a therapeutically effective amount of the antibody of claim 23. 

45. The method according to claim 44, wherein the method is for the treatment of a patient having need to antagonise or 
selectively antagonise the polypeptide of claim 22. 

46. The method according to claim 44, wherein the method is for the treatment of a patient having need to agonise the 
45 polypeptide of claim 22. 

47. Use of the compound of any one of claims 24 to 26 in the manufacture of a medicament for the treatment of metabolic 
disease. 

50 48. Use of the compound of any one of claims 24 to 26 in the manufacture of a medicament for the treatment of diabetes. 

49. Use of the antibody of claim 23 in the manufacture of a medicament for the treatment of metabolic disease. 

50. Use of the antibody of claim 23 in the manufacture of a medicament for the treatment of diabetes. 
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51. A method for the treatment of metabolic disease in a patient comprising administering to the patient a therapeutically 
effective amount of the compound of any one of claims 24 to 26. 
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52. A method for the treatment of diabetes in a patient comprising administering to the patient a therapeutically effective 
amount of the compound of any one of claims 24 to 26. 



53. A method for the treatment of metabolic disease in a patient comprising administering to the patient a therapeutically 
effective amount of the antibody of claim 23. 

54. A. method for the treatment of diabetes in a patient comprising administering to the patient a therapeutically effective 
amount of the antibody of claim 23. ^ 

55. Cells genetically engineered ex vivo or in vivo to overexpress, underexpress or to exhibit targeted insertion or 
deletion of the polypeptide of claim 22. 



56. 



Non-human animal comprising cells according to any one of claims 1 0, 1 1 , 1 9 and 55. 
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Figure 2 
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Figure 3 



CLUSTAL W (1.74) multiple sequence alignment 

DOP1_DROMB MYTP H P FG FL 1 1 LVPMTN AMRAI AAI AAG VG S VAATV ATS TTS S I SS S TT 1 1 NTS S ATT I 
temp 

DOPl_DROME GGNHTSGSTGFSTNSTLLDADHLPLQLTTAKVDLDIEIDIQLLTNGYDGTTLTSFYNESS 
temp , 

DOP1_DROME WTOASEMDT I VGEE P E PLS LVS I WVG I FLS VL I F LS VAGN I LVC LA I YTDG - SLPRIGN 

temp - MESS FSFGV ILAVLASL I IATNTLVAVAVLLL I H KNDGVS L 

:*..*;:*:** *:****.:*: 

DOP1_DROME LFLASLAIADLFVASLVMTFAGVNDLLGYWI FGAQ - - FCDTWVAFDVMCSTAS I LNLCAI 

temp CFTLNIiAVADTLIG- -VAISGliTDQLSSPSRPTQKTLCSLRMAFVTSSAAASVLTVMLI 

* .**:*■* * :.**. :* :*. :** .::**:*.: * 

DOPl_DROME SMDRYIHIKDPLRYGRWVTRRVAVITIAAIWLLAAFVSFVPISLGIHRPDQPLIFEDNGK 

temp TFDRYLAIKQPFRYLKXMSGFVAGACI AGLWLVSYLIGFLP --LGI--P MFQQTAY 

**.*.♦* . ** **..**. j *** * ;*::.. 

DOPl_DROME KYPTCAIj- -DLTPTYAVVSSCISFYFP-CVVMIGIYCRLYCYAQKHVKSIKAVTRPGEVA 

temp KG-QCSFFAVFHPHFVLTLSCVGF- FPAMLLFVFFYCDMLKIASMHSQQIRKMEHAGAMA 

* * : : : * ** r . * ** :::: :** : *. * :.*: : :,* :* 

DOP1DROME EKORYKSIRRPKNQPKJCFKVRNl^THSSPYHVSDHKAAVTVGVIMGVFLICWPFFCVNI 
temp GG--Y-- -RSP- - - R TP - - - SDFKALRTVSVLIGSFALSWTPFLITGI 

DOPl_DROME TAAFCKTCIGGQTFKI LTWL-GYSNSAFNPI IYSI FNKEFRDAFKRILTMRNPWCCAQDV 

temp VQVACQECmYLVLERYLWLLGVGNSLLNPLIYAYWQKE\miX3LY-HMALGVKKVLTSFL 

*:* .:: ***,**;#*;#*...**_* - ... :. : 

DOPl_DROME GN IHPRNSDR F ITD YAAKNVWMN SGRS S AELEQVS AI 

temp LFLSARNCGPERPRESSCHIVTI SSSEFDG 

: . **:*:: 
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